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TWO REMARKS ON EXTREME FORMS 
MARTIN KNESER 


1. Introduction. The following remarks concern two different parts of the 
theory of extreme quadratic forms. In §2, I shall give a new proof for the 
theorem of Voronoi (4), which asserts that a form is extreme if and only 
if it is both perfect and eutactic. (For the definitions see, e.g., Coxeter (2) 
or the text below.) There is indeed a comparatively simple proof in Bachmann's 
Zahlentheorie, 1V, 2. For two reasons, however, it may not be useless to 
communicate another proof here. First, I shall prove the necessity and the 
sufficiency in one step, and second, as a consequence, my proof requires a 
minimum of calculation. In §3, I shall add a new senary extreme form to 
Hofreiter’s list (3) and Coxeter’s paper (2). This form was found independently 
by Barnes (see the succeeding paper (1)). I therefore restrict myself to 
determining its group of automorphs, which is not done in Barnes's paper. 


2. Voronoi’s theorem. In this section, we denote: real symmetric 
(m X n)-matrices by capital letters A, B,..., the 4$n(m + 1)-dimensional 
space of these matrices by R; vectors, considered as (m X 1)-matrices, by 
small letters x, y,...; the transposition of a matrix by a prime, and real 
numbers by Greek letters. Let x’Ax be a positive definite quadratic form, 
and uw(A) the minimum of x’Ax taken over all x with integer coordinates not 
all 0. This minimum is attained at a finite number of vectors, called minimal 
vectors; for these we shall reserve the letter m. The form x’Ax is called extreme 
if the quotient 

5(B) = p(B) det(B)-'” 


has a local maximum at B = A, ie., if 6(A + C) < 6(A) for all matrices 
C € R whose elements are sufficiently small. Since 6(A B) = 6(B), we may 
restrict C to a hyperplane through the origin not containing A, e.g. the 
hyperplane H of R defined by tr(A~'C) = 0. Continuity shows that, for small 
C, A + Cis positive definite and the minimal vectors of A + C are contained 
among those of A. Hence 5(A + C) < 6(A) means that for at least one 
minimal vector m of A, the inequality 


(1) m'(A + C)m.det(A + C)-'" < m'Am.det(A)-™ 
holds; or, what is the same, the union of the regions K,, of C’s such that 
A+ C is positive definite and (1) holds, contains a neighbourhood of the 
origin O in R; or, finally 

(a) U,HC\K, contains a neighbourhood of O in H. 
Now, the following lemma shows that K,, is convex and H (/)/ K,, is strictly 
convex. 
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LemMA. Let m be a fixed vector and } > 0. The set K of positive definite 
symmetric matrices C satisfying the inequality m'Cm < d det(C)'™ is a convex 
cone with its vertex at the origin O. If H is a hyperplane not containing O, H (\ K 
is strictly convex (i.e., if C and D are in H(\K and if 0< 7 <1, then 
tC + (1 — r)D is an inner point of H(\ K). 


Proof. We have to show that if C and D are in K, so is rC + (1 — r)D, and 
that the latter is an inner point unless D = «C. Since K obviously is a cone, 
we may replace D by a scalar multiple and so may assume 


m'Cm = m'Dm = m'(rC + (1 — r)D)m. 
It remains to prove that the inequalities 
det(C)!” > uw = m’'Cm.dr—',_ det(D)'*>u 


imply det(rC + (1 — 7r)D)'” > uw with equality only if D = C. This is well 
known. It is proved by transforming C and D simultaneously to diagonal 
form and then applying the inequality 


I] (ry; + (1 — 7)5,)'" > I] v"+(1- ane 6. 


Moreover, K,, possesses a tangent hyperplane at the origin. This will be 
shown by developing (1) into a power-series in the elements of C and taking 
the linear terms only. In point of fact, write (1) in the form 


1 + m'Cm p(A)—" < det(E + A-'C)'™. 


For the half-space determined by the tangent hyperplane and containing K,,, 
we then obtain the linear inequality 


m'Cm w(A)* < + tr(A-C). 
Restricting C to H, we obtain 
ln(C) = m'Cm < 0. 
Next we show that (a) is equivalent to 
(b) The half-spaces /,,(C) < 0 cover the whole space H except O. 


First suppose (a) holds. Let C # O be any point of H. It follows from (a) 
that there exist an m and a \ > 0 such that AC is in L, = H(\ K,,. Since 
L,, is strictly convex, uC is an inner point of L,, if 0 < u < A, and thus 


Im (C) = wl, (uC) < 0, 


which proves (b). Conversely, suppose (b) holds. Let S be the unit sphere in 
H. For every C € S there is an m with 1,,(C) < 0. Since J, = 0 is the tangent 
hyperplane of L,, at O, there exists a \ > 0 such that AC is an inner point of 
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L,,. Then there is a neighbourhood U of C on S such that wD € L,, for every 
D € Uand 0 < u < X. As S is compact, it is covered by a finite number of 
U’s. Denote by A» the minimum of the corresponding A’s. Then the solid 
sphere of radius A» is contained in U,. Lm, which proves (a). Transforming (b) 
into a statement about the complementary half-spaces 1,,(C) > 0, we get 


(c) O is the only point of H common to all the half-spaces 1,,(C) > 0. 


Now, let H’ be the space of linear forms on H, and M C H’ the convex set 
generated by O and the /,,. Since the elements / of any hyperplane in H’ 
passing through the origin satisfy an equation /(C) = 0 with some fixed 
C # O, the statement (c) means that there is no hyperplane through O in 
H’ leaving M on one side. Since M is convex, a neighbourhood of O in H’ is 
contained in M. This assertion can be divided into two parts. First, H’ is the 
linear space generated by M, i.e., M is not contained in any hyperplane 
in H’. This means that there is no hyperplane passing through O and the /,,. 
Second, O is an inner point of M relative to the linear space generated by M, 
i.e., there are numbers po, p, > 0 with sum 1, such that O = po.0 + > palm. 


We may multiply this equation by an arbitrary positive number, and so 
the condition that the sum of the coefficients is 1 may be dropped. We have 
the following two statements which, together, are equivalent to (c). 


(d) The equations /,,(C) = 0 have no common solution C = O in H. 


(e) There exist positive numbers p, such that > pala(C) = 0 for all C 


in H. = 
Since H is defined by tr(A-'C) = 0, (e) is equivalent to 
(2) > pm m'Cm = o tr(A~'C) for all C in R, 


with some constant ¢. Putting C = A we get ¢ > 0. If C = x.x’ we obtain 
(3) > pm(m'x)® = o tr(A~ xx’) = o tr(x'A~'*x) = o x'A~'x. 


This means that A is eutactic. Conversely (2) follows from (3), since every 
symmetric matrix C is a linear combination of matrices of the type xx’ (this 
is nothing else than the well-known theorem that every quadratic form is a 
linear combination of squares of linear forms). Finally, (d) and (e) also imply 
that the equations m’Cm = 0 have no common solution in R, except C = O. 
For, if C is a solution, (2) implies C € H and then C = O according to (d). 
This means that A is perfect and so we have proved 


VorRONO!’s THEOREM. A positive definite quadratic form is extreme if and only 
af it is both perfect and eutactic. 


3. The new senary extreme form. The senary form 


3 6 2 
x"Ax = + (x4 — XXir3 + x44) + (= «,) 


t=1 t=—1 











148 MARTIN KNESER 


with the matrix 


211411 

121141 

A=|11211}3 

11211 

141121 

1134112 
has determinant 2—*.3°.13 and minimum 2. There are 21 pairs +m, of minimal 
vectors. First, the pairs +m, (i = 1, ... , 6) of unit vectors with ith coordinate 
+1, all others 0; second, 12 pairs +m, (i =7,...,18) with x = 1,x,;= —1 
(k = 1 mod 3) and x, = 0 if j ¥ k, 1; third, three pairs +m, (i = 19, 20, 21) 
with coordinates obtained from x; = x4 = 1, x2 = x5 = —1, x3 = x5 = 0 by 


permuting the values 1, —1, 0. That A is perfect is seen by an easy calculation. 
That it is eutactic follows from the identity 


6 18 21 
(4) 39 y'Ay = 62 (m'Ay)* + 50 (m'Ay)* + 72 (m'Ay)* 


which, by the substitution y = A-'x, yields a formula of type (3). The 
representation (4) of y’Ay as a linear combination of the squares of the linear 
forms m‘Ay is unique. This fact is important for the determination of the 
group of automorphs of A, i.e., of those unimodular matrices U which trans- 
form A into itself: U'’AU = A. At the first glance there is a group G of 48 
automorphs, namely those permutations of the variables x,, which change 
each pair (x,, X,43) into a pair of the same kind. We shall show that G com- 
bined with +£ is the whole group of automorphs of A. Obviously every U 
permutes the minimal vectors and preserves the scalar products m!Am,. 
Now U transforms the representation (4) into another of the same kind, with 
Um, instead of m,. Since (4) is unique and the first coefficient, 6, is different 
from the two others, 5 and 7, U permutes the pairs +m, (i = 1,..., 6) 
amongst each other. This permutation is such that any quadruple +m,, 
+m ,,; changes into one of the same kind, because m! Am, = 1 or #1 accord- 
ing ast # k ori = k mod 3. Hence we can write every automorph as a product 
of an element of G by an automorph V with Vm, = +m, (i = 1,..., 6). 
As m! Am, ~ 0, the sign must be the same for all i and so V = +E. Therefore 
the total group of automorphs is the direct product of G and the cyclic group 
+E of order two. 

As we have seen, the group of automorphs is not transitive on the minimal 
vectors, contrary to most other known extreme forms (2), nor is it irreducible. 
The irreducible subspaces are: One of dimension one, generated by 

6 
, m+; 
i=1 
one of dimension two, generated by m9, m2, m2; one of dimension three, 
generated by m; = m, — m,4,m_ = M2 — ™5,nN; = mz — mg. The representation 
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of G in the third subspace is isomorphic. G induces all transformations of the 
form n,— +n,, i—k being any permutation and + any combination of 
signs. So G is isomorphic to the extended octahedral group. This completes 
the determination of the structure of the group of automorphs. 
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NOTE ON EXTREME FORMS 


E. S. BARNES 


Let f(x1,...,%) = Dasa, be a positive definite quadratic form of 
determinant D = |a,,|, and let M be the minimum of f for integral x1, ... , x, 
not all zero. The form f is said to be extreme if the ratio M"/D does not increase 
when the coefficients a,, of f suffer any sufficiently small variation. 

All extreme forms in m variables are known for nm < 5. Hofreiter (2) investi- 
gated the problem of finding all extreme forms in 6 variables and listed four 
forms; but, as is pointed out by Coxeter (1), one of these (F;) is certainly 
not extreme. Coxeter (1) actually finds independently four extreme forms 
(including three of the four listed by Hofreiter) and makes the reasonable 
suggestion that the list is now complete. 

The main purpose of this note is to show that there is an extreme form in 
6 variables not given by these authors, namely 


(1) Ff (x1... » Xe) = (> x.) + L, ols Xa43)s 


where generally 
o(x,y) = x* —xy+y’, 
for which 
13-3° 


M=2,D=-3-. 


The form (1) is the particular case nm = 2r = 6 of the form 
n 2 r n 
(2) f (x1, ne | Xn) = (> x.) + } o(x,, Xr43) + >» X., 
t=1 j=l k= 2r+1 


where n > 2r > 2 (and the last sum is empty if m = 2r). I show here that 
the form (2) is extreme if and only if 


(3) 4r —2>n>2r>6. 
We first examine all integral sets x;,...,,, not all zero, for which 
(4) f <2. 


Noting that 
=Oif (x,y) = (0,0), 
(5) o(x, y)\ = Lif +(x, y) = (1,0), (0, 1) or (1, 1), 
> 3 otherwise, 
Received October 1, 1954. The author would like to thank Professor Coxeter and Professor 


Mordell for their comments on an earlier draft of this note. 
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we see that (4) implies that 


(6) (xX 4, Xr43) < 1 Gj = Eo secglDs 
and that equality can hold in (6) for at most two values of j. 
If, say, &(%1, Xr41) = O(X2, X42) = 1, 6(xy, X45) = O (7 > 3), then 
Xy = X4,7= 0 (3 <j<r), 
and (4) requires 2f = 2, x, = 0 (k > 2r + 1), x1 + x2 + X41 + X42 = O. 
Using (5), we see that all possible sets (x;, x2, X41, X-42) are +(1, —1, 0,0), 
+(1,0,0, —1), +(0, 1, —1,0), +(0,0, 1, —1), +(1, —1,1, —1). 
If only one $(x;,%,4;) is non-zero, say (x1, %,41), then x, = x4, = 0 
(2<j7 <7) and Xo417 +... +%,? ¢ 1. There are thus two possibilities: 
(i) Xorg =~. - = Xe =O, fF = Bey? + BWo41® + Kix 41, 
and trivially f > 2, with equality only if 
(x1, X41) = + (1,0), + (0, 1); 
(ii) x, = + 1 for some k > 2r + 1, 
f = (1 + X41 + x)? + 2, 
and so f > 2, with equality only if 
(%1, X p41, Xe) = +(1,0, —1), +(0, l, —1). 
If finally all $(x,, x,4,) are zero, then 
n 2 n 
f= ( p ss) + y Xe", 
k=2r+1 k=w2r+1 


and it is easily seen that f > 2, with equality only if some x, is +1 and the 


rest zero, or if some pair (x,,x;) = +(1, —1) and the remaining x, are 
zero. 


We have thus shown that M = 2. Also, if with each pair + (x,°",.. . x,°) 
of minimal vectors (i.e., sets with f = 2), we associate the linear form 
A; = Ar(¥1, eeey Yn) = d xy, 
we see (using the obvious symmetries of f) that there are precisely 
(7) s = n(n + 1) + 4r(r — 3) 
such linear forms, given by 
ys (l<i¢n), 
(8) ao (l<ej<ken,kArt+j; j=l,...,r), 
Yi — Ym + Veet — Vrtm (lgel<mc¢r). 


If r > 3, it is easily verified that f is uniquely determined by the fact that 
it has minimum 2 and associated linear forms (8); thus f is perfect (in the 
sense of Voronoi (3)). If however r < 3, (7) gives s < $n(m + 1) and sof 
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cannot be perfect. We have therefore shown that f is perfect if and only if 
, > @. 

Now Voronoi (3) has shown that a form is extreme if and only if it is 
perfect and eutactic. We therefore consider next the problem of deciding 


when f is eutactic, that is to say, when its adjoint F(y, ..., ¥,) is expressible 

as 

(9) F(y1,- +59) = Le pds px > 0 i eee 
= 


The labour of calculating F (and the determinant D of f) may be lightened 
by using the following method: 


The form 
(10) g(21,.--, 20) = (arti +... + ante) + . 2; 
is easily found to have determinant 
(11) Die) =1+ Lav, 
and adjoint a multiple of 
(12) G(s:,...,2) = Lewes; 
with 
C4 = 1+ > an’ Cy = —agy; (i ¥ j). 
ket 


Under the linear transformation T defined by 


aos 


x3 
fm i,...,9), 
egy = (2/ V 3) En43 G ”) 
Xp = BZ, (k = 2r+1,...,n), 


f in (2) is reduced to the form (10) with 


(13) a,=1 (1 <ig¢r,27+1<cicn), a, = V3 (r +1 <i < 27), 
so that 


(14) 1+ Sag =n+2+1. 

t=1 
Since T has determinant (2/+/3)’, it follows from (11) and (14) that f has 
determinant 


(15) D = (%)'(n + 2r + 1). 


Finally, a straightforward multiplication of matrices now shows that 
F(yi,..- 59a) is a multiple of Sb, yw, with 
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b a 1<i<2r, 
** " 13(n + 2r), i > 2r, 
—12, l<i<j<2,j-—is#¥yr, 
(16) by, = }2(m + 2r — 5), l<i<j<2,j-i=r, 
th, i < 2r,j > 2r, 


Corresponding to the enumeration (8) of the associated linear forms, we 
write (9) as 


(17) F(y1,.-- 59a) = Lede + Loply; — yx)? 
+ Lr im(y: — Vm + 7 Vrom)*, 


where the suffixes have the ranges given in (8), and solve (17) for the 
s = 3n(n + 1) + $r(r — 3) coefficients p;, os, Tim- 
First, we have immediately 


(18) on = —b» = 3, k>j> 2r, 
(19) ox = —b» = 6, k > 2r >}. 


The coefficient of 2yy, for 1 <7 < k < 2r7,k —j # ris —cn —Tim, where 
l=jorj—r,m = kor k — 1; hence we have 


(20) Cx = Ci rtke = Crtjrte = 12 — T jks 1 <j « k < Tr. 

The coefficient of 2y, ¥,4,, for 1 <7 <1, is 

(21) Ti 4 eee a Ty~1,9 + T4941 ok eee a Te = 2(n oa 2r = 5) 
(j=1,...,7f). 


The coefficient of y,? is 


Pit out... omt tiat... + tir 
= pi t+ (12 — riz) +... + (12 — tip) + (12 — 712) +... + (12 — 71) 
+6(m — 2r) + 712+... 4+ 71 
pi + 12(2r — 2) + 6(m — 27) — (rin +... + 717) 
pi + 4n + 8r — 14, 


using (19), (20) and (21); since },, 
The same argument gives 


(22) pi = 6 L<étecS 


The coefficient of 2,41? is 


4(n + 2r — 2), it follows that p; = 6. 


P2r+1 + O71 2r+1 + eee + O27 ,2r+1 + O2r+1,274+2 + 20 | T2r+1: 
= por41 + 6(2r) + 3(m — 2r — 1), 


using (18) and (19); since 52,412.41 = 3(m + 2r), it follows that p24; = 3. 
The same argument gives 


(23) p, = 3, 2r+1<icn. 
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Now (20) and (21) give 
12 +... tory = (12 — rz) +... + (12 — T1,) 
12(r — 1) — 2(m + 2r — 5) 
2(4r —1—n); 


if the o,, are all strictly positive, this shows that n < 4r — 1. Thus f is not 
eutactic if n > 4r — 1. 


If, however, m < 4r — 2andr > 3, we can show that f is eutactic by taking 
the particular solution 


(24) a 2(n + 2r — 5) 





T im (1 ql<m¢ r) 
r—l 
of the r equations (21). Then (20) gives 
(25) on @ 12 —- ete —5 _ Me —i-s), 4 





r—l1 r—1 
for all relevant j, k, and we have exhibited a solution of (17) in which all the 
coefficients p;, oj, Tim are positive. 

We have now established our assertion that f is extreme if and only if (3) 
holds. In particular, we have shown that the senary form (1), for which 
n = 2r = 6, is extreme. 

The form (2) gives some information on the possible structure of perfect, 
eutactic and extreme forms, as well as extending Coxeter’s table (1, p. 439) 
of extreme forms for each n > 6. 

Thus Coxeter remarks (1, p. 396): “For every known perfect form in less 
than nine variables there is a solution [of (9)] with the p’s all equal.”” However, 
for the form (2), there is no such solution for any m > 2r > 6. This assertion 
is clear if m > 2r, from (22) and (23); if m = 2r, it follows from (24) and (25), 
since equality of the r’s and o’s would require 


4r—5=2r-—1,7r=2. 


Coxeter also remarks (1, p. 392): ““We do not know whether every perfect 
form is extreme.” The form (2), however, is perfect and non-eutactic (and so 
not extreme) for any r > 3, m > 4r — 1. 
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ON INTEGERS n RELATIVELY PRIME TO /(n) 
JOACHIM LAMBEK and LEO MOSER 


1. Introduction. If m and m are two integers chosen at random, the prob- 
ability that they are relatively prime (2, p. 267) is 6x-*. This result may still 
hold when m and mn are functionally related. Thus, Watson (3) recently 
proved that for a irrational, the positive integers m for which (m, [an]) = 1, 
have density 6x~*. A different proof of a slightly more general result was given 
by Estermann (1). The present authors found that the number of positive 
integers not exceeding x, with (n, [n*]) = 1, is 6r-*x + O (xt log x). In this 
paper we generalize the latter result. Roughly speaking, if f(1), f(2),...isa@ 
non-decreasing sequence of non-negative integers, tending slowly to infinity, and 
if the intervals over which f(m) = n increase slowly with n, then the probability 
that n be relatively prime to f(n) is 6x-*. 


2. Notation. As usual, let [a] denote the largest integer not exceeding a, 
and let (m, m) be the greatest common divisor of m and n. Small Roman letters 
usually denote positive integers. Let f = {f(1), f(2),...} be amy sequence of 
non-negative integers; then we define as follows: 

Q,(x) is the number of n < x for which (n, f(n)) = 1. 

If 

lim x~*Q,(x) 
exists, we denote it by P,, and call it the probability that n and f(n) are relatively 
prime. 

R, (x; a, b) is the number of multiples of a, not exceeding x, which are mapped 
onto b by f. 

S, (x; a, 6) is the number of multiples of a, not exceeding x, which are mapped 
onto multiples of b by f. Usually, the suffix f in the functions defined above will 
be omitted. 

f*(n) denotes the number of m such that f(m) = n. 


3. Preliminaries. We assume at the outset that 
(i) f is non-decreasing. 
The following relations between the functions defined are then immediate: 


(3.1) S(x;a,b) = >>!" R(x; a, kb), 
k=0 

(3.2) S(x;1,1) = x, 

(3.3) f(b) = R(@;1, 5). 


We now prove several lemmas. 


Received July 26, 1954. 
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LEMMA 1. fz) 


Q(x) = 2 #(d) S(x;d,d). 


Proof. The Mébius function yz has the following property (2, p. 234): 


(3.4) 2d »(d) =1 (m= 1), 2d »(@) =0 (n> 1). 
Hence, - 
O= DV i=D DY w@ = La S(x;4,¢), 
@. fedmt ngz d\(n, f(n)) d=1 


since S(x; d, d) is the number of m < x such that d|m and d|f(m). The fact that 
f is non-decreasing ensures that the last summation need not be carried beyond 


f(x). 
LemMMA 2. |R(x;a,b) — a~' R(x; 1, 5)| < 1. 


Proof. By (i), the set of R(x; 1,5) numbers mapped on 3b by f consists of 
consecutive integers. The R(x; a, 6) multiples of a mapped on } form a subset 
whose neighboring elements differ by a. The required result now follows from 
the fact that every set of a consecutive numbers contains exactly one multiple 


of a, while every set of fewer than a consecutive numbers contains at most one 
multiple of a. 


Lemma 3. |S(x; a,b) — a~! S(x;1, b)| < b- f(x). 

Proof. Replace 6 by kb in Lemma 2 and use (3.1). 

We now assume 
(ii) f* is finite and non-decreasing. 

LEMMA 4. . a 

0< LH Ok) — LP < ss). 
Proof. The result follows by summing over k the obvious inequalities 
f* (bk) > b-*(f* (bk) + f*(bk — 1) +... + f*(bk — b+ 1)) 

” f* (bk) < b-*(f* (bk) + f* (bk + 1) +... + f*(bk + 6 — 1)). 

Lemma 5. S(x;1,6) — b-' S(x;1, 1) = O(f*(f(x))). 

Proof. We observe that in (3.1) all terms of the sum, with the possible 


exception of the last, are unaltered by replacing x by ~. The last term is 
O(f*(f(«))). Hence, using (3.3), we have 
[d-* s(z)) * 
2 R(@;1, kb) + OF (f(@))) 
[o-* s(z)] 


> fe) + Of (f(@))). 


k=0 


(3.5) S(x; 1, 5) 
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Also, putting 6 = 1 in (3.5) and absorbing the last f(x) — b[b-' f(x)] terms in 
the error term, we obtain 
b[d-* s(2)] 


(3.6) bS(x;1,1I) =o" DY fe) +040 ))). 


k=0 


The required result now follows from (3.5), (3.6), Lemma 4 with 
s = [b-' f(x)], and the fact that f*(b[b-' f(x)]) < f*(f(x)). 


4. Results. We proceed to estimate S(x; a,b) and Q(x). Consider the iden- 
tity 
(4.1) S(x; a,b) = (ab)—' S(x; 1, 1) + (S(x; a, 6) — a S(x; 1, d)) 
+ a (S(x; 1,6) — b-' S(x; 1, 1)). 


Using (3.2) and Lemmas 3 and 5, we obtain from (4.1) that 


(4.2) S(x; a,b) = (ab)-'x + O(6 f(x)) + O@ f*(f(@)). 
It is well known that 

(4.3) > d* = logr + O(1) 

and 

(4.4) > u(d) d* = 6x + O(r"). 


Using Lemma 1, (4.2) with a = } 


d, (4.3) and (4.4), we obtain 


THEOREM 1. [f (i) f is non-decreasing and (ii) f* is finite and non-decreasing, 
then 


Q(x) = 6x-*%x + O(f(x) log f(x) + O(f*(f(x)) log f(x) + O(xf(x)-). 
Example 1. f(x) = [x4], f*(x) = 2x + 1, Q(x) = 627% + O(x! log x). 
More generally, if k is an integer >1, and f(x) = [x’*], then 

Q(x) = 6x-*x + O(x'"* log x). 
An easy consequence of Theorem 1 is 


Tueorem 2. [Jf (i) and (ii) hold, as well as (iii) f(x) log f(x) = o(x) and 
(iv) f*(f(x)) log f(x) = o(x), then P, = 6x-. 


Proof. By Theorem 1 and the definition of P, it suffices to check that 
xf(x)-! = o(x), that is, f(x) —+ ©, and this is a consequence of (i) and (ii). 


5. Discussion. Clearly P; is unaffected by changing the value of f(x) on 
any set of zero density. Thus one can easily construct functions for which 
P, = 6x-* but none of (i) to (iv) hold. On the other hand, none of the 
conditions (i) to (iv) are superfluous for Theorem 2, and they are therefore 
independent, as may be seen from Examples 2 to 5. 
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Example 2. f(2x) = 2[(x/2)*], f(2x + 1) = f(2x) + 1. Here f*(2x) = f*(2x 
+ 1) = 2x + 1. Only (i) is violated. However, since (m, f(m)) > 2 for m even, 
Q(x) < [4x], and if P exists then P < 4 < 627°. 


Example 3. f(x) = 2{x*]. Only (ii) is violated, but again (n, f(m)) > 2 for 
nm even, so that P + 627”. 


Example 4. f(x) = x. Only (iii) is violated, but clearly Q(x) = 1 and P = 0. 


Example 5. f(x) = [logio x]. Only (iv) is violated. Let x = 10*"*! and con- 
sider all m = 107° + 2s < x, s > 1, so that (m, f(m)) > 2. Their number is 
$(10?"+! — 10") = 0.45%. Hence Q(x) < 0.55x, and P ¥ 6x~*. Actually it is 
not difficult to see that for this f, P; does not exist. 
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ON SOLUTIONS OF x* = 1 IN SYMMETRIC GROUPS 
LEO MOSER anp MAX WYMAN 
1. Introduction. Several recent papers have dealt with the number of 


solutions of x* = 1 in S,, the symmetric group of degree n. Let us denote this 
number by A,¢ and let A,» = 7,. Chowla, Herstein and Moore (1) proved: 


1.1 T, = Tae + (nm — 1)Ty-2 7, 27,21 
1.2 nt < T,/Te-1 <n? +1 
[4n) 

1.3 T, = nS 1/{2°j!(m — 2j)!} 

L j= 
1.4 pm T,x"/n! -_ et 

n=0 
(n/e)"e™ 

1.5 Be ~ . 


In §2 we establish a connection between 7, and the Hermite polynomials. This 
will enable us to refine 1.5 and to prove a conjecture concerning 7,/7,-: 
made in (1). 

Jacobsthal (3) showed that for p a prime 


1.6 > Aap x’ /n! = t?””, 
n=0 
This was generalized by Chowla, Herstein and Scott (2) who proved 


1.7 > Asc x*/n! = exp >. x*/k. 
n=0 kid 


The problem of finding asymptotic formulae for A, was proposed in (2). 
In §3 we obtain asymptotic formulae for 7, and A,,, by a method which also 
yields results in the general case. 

It was pointed out in (2) that Frobenius’ theorem implies 
1.8 A,» = 0 (mod p) 
and this, together with an explicit expression for A, , constitutes a generaliza- 
tion of Wilson’s theorem. Some other arithmetic properties of 7,, were devel- 
oped in (1). In §4 we obtain still another generalization of Wilson’s theorem, 
and also further arithmetic properties of T,. 

In §5 we show how some corresponding results can be obtained for the 
number of solutions of x* = 1 in alternating groups. 

Received June 16, 1954. 
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2. T,, and the Hermite Polynomials. The recurrence 1.1 is similar to that 
satisfied by the Hermite polynomials, namely 
2.1 Hi, (x) = 2x Hy-1(x) — 2(nm — 1) H,-2(x), Ho(x) = 1, Hi(x) = 2x. 
In fact, we prove the following theorem: 


* Joh 
2.2 T, = BE, fo-< @ 


Since 2.2 is readily verified for » = 0 and m = 1 we may proceed by induction. 
Using the induction hypothesis and 2.1 we obtain, 

Th-1 + (n oat 1)T,-2 

— Ha-s(i/2") , (nm — 1)Hy-2(i/2*) _ Ha(i/2') 

—_ gr igh@—D jr 2h) _ 72" 


which completes the proof. 








From 2.2 we can obtain the asymptotic expansion of T,. The asymptotic 
expansion of H,(x) is given by Szegé (4, p. 194) to be 


p—1 
2.3 dn e *"'H, (x) = cos(N*x — 3nx) >> U,(x) N~’ ’ 
v=0 
p-1 
+ Nsin(N'x — 4nx) >> V.(x) N~’ + exp{— N*|4(x)|} O(n), 


v=0 


where N = 2n + 1 and 








_ Tia+1) _ T(n+2) ; 
= Tan +1) % *~ FGn+3/2) 

according as is even or odd. The coefficients U,(x) and V,(x) are polynomials 
depending on v; they contain only even and odd powers of x, respectively. 
The first two terms of this expansion yield 


T'(n + 1) 
I'(3n + 1) 


Using 2.2, 2.4 and the asymptotic expansion of the gamma function we obtain 
the theorem 


4 
ina 7 
2.5 rw tlhe (14 244...). 


This is a refinement of 1.5. A numerical check of 2.5 and a still more accurate 
formula for T,, derived in §3 will be given later in the paper. 
We next consider R, = T,/T7,-:. By 1.1 we have 


2.4 H, (x) ~ e*” (cos(N'x — 4nr) + ix*N™ sin(N*x — in7)). 


2.6 Rays = 1 + n/R,. 
Iterating 2.6 gives the continued fraction expansion 
27 Rueidd- n—-1l1n—2 1 








+ te if} ***h° , 
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We now use elementary means to sharpen the bounds for R, given in 1.2. 
Let a, be the positive solution of x? — x — n = 0, i.e., 


2.8 a, = (1 + $(4n + 1)4). 
We prove 
2.9 An-1 < R, < Ay. 


Proof. We proceed by induction over #. The result is trivial for m = 1. 
Assume it true for » = K, then 
Rr+i =l1+ K/Rx >il+ K/ax = Gr 
Also, 
Revi = 1 + K/Re <1 4+ K/org-s. 
It remains to show that 
1+ K/agei < oxy 


and this follows from 2.8 and simple algebraic manipulation. An easy conse- 
quence of 2.9 is 





2.10 lim (R, — n') = }. 
In (1) it was conjectured that 

2.11 R, ~ nt + A+ Bn? + Co +... 

for appropriate constants A, B, C,.... We shall prove this conjecture and 

obtain the following theorem 

2.12 R,~n'+4—im'+.. 
Proof. Let us consider 

2.13 fa(x) = xH,(x)/H,-1(x). 

From 2.2 and 2.13 we have 

2.14 R, = —fu(i/2'). 

However, it is well known that 

2.15 Hi(x) = 2n H,-1(x). 

Hence, 

2.16 fla) @ 92 OE) a 


* H.(x) (log Ha(x)) ° 


If we restrict x to be pure imaginary, say x = it, t > 0, then the expansion 
2.3 takes the form 


; p-l p—1 
2.17 Hy(x) ~ pret ite-™ ‘| > U(x) N~” + aa V,(x) v-| . 
v=() 7 van 


From 2.15, H,’(x) is known to have an asymptotic expansion of the form 
2.3. Hence differentiation of 2.3 is justified and leads to 
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, mt. Se a , 
2.18 (log H,(x))’ ~ x — iN? + 2d, a(x) N +aad Q,(x) N~ 


where again, g,(x) and Q,(x) are polynomials in x. Since these polynomials 
contain only even powers of x, and odd powers of x respectively, the coeffi- 
cients of the various powers of N~’ will be real in 2.17 and pure imaginary 
in 2.18. Hence for x = i/23, 








: 2'n 
2.19 fali/2*) ~ 3 —N't 6,4+BN +... 
where §;, 82... are constants. Hence, 
4 

2.20 R~28(14+ 4424...) 
where 7:1, ¥2... are constants. Since N = 2n+1 we may expand in terms of 
n and find 
2.21 R (1 4% ) 

; n~n +++ +... 

n n 

where 4;, 52... are constants. We have calculated the first three terms to be 


those given in 2.12. 


3. Asymptotic Expansions. Previously in this paper we obtained an 
asymptotic expansion of 7, by recognizing the relationship between 7, and 
the Hermite polynomials. This method of course does not help us when con- 
sidering the more general problem involving A, 4. In this section we shall ob- 
tain an asymptotic expansion for 7, by a different method. This method not 
only applies to A, 4, but also to many other problems, some of which will be 
discussed in a later paper. 

In our method we make use of a result, indicated in the following lemma, 
which is probably known. Since we have been unable to find a reference we 
include, for completeness, a proof of the 


Lemma. Let f(z) be a function of a complex variable z regular in a neighborhood 
of z = 0. If 
(a) f(0) = 0, 
(b) the Maclaurin expansions of f(z), e/ are 
fe) = Dak, M=1+ DL de’, 
k=1 k=l 


(c) la,| < Keo", 
where K, o are positive numbers, then 


3.1 lb,| < K o* (1 + K)*". 
Proof. By Taylor’s Theorem, 


1 (d* 
- a= (SEP) 


1 d'e™™ 
$3 no B(fE) . 


cy ed 


— oOo tm oO FT OH 





ils 


of 


d 
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Since b; = a, the result is trivially true for k = 1. From 3.3 we may write 


— rod 
3.4 bess = (k +1)! + 1)! ds* aE dz saat 
Expanding the kth derivative of a product we have 








z 
3.5 bu = x drni(k — $ +1), by = 1. 
0 
Hence 
k 

3.6 lbesal < yi \b,| |ax—o+1]. 
From (c) we obtain 

k 
3.7 loses] < K oi + > |d,| -*) ; 

s=l 


From 3.7 the result follows easily by induction. 
We now proceed to outline our method for finding an asymptotic expansion 
of 7,. By 1.4 and Cauchy’s theorem, 


3.8 ao af. (e s+ie*) ods, 


" 3s: 
where C is the circle z = Re. Hence 7, can be expressed as 
3.9 T, = uf de, 
where 
3.10 H = (n! e®**”") /2eR" 
and 
3.11 (6) = R(e® —1) + 4R(e" — 1) — ind. 
If we let 
3.12 «= Rk" 
and 
3.13 [= faa 


then a simple calculation shows that |J| = O(e-®"). Since we shall show that 
our asymptotic expansion of the integral in 3.9 involves only powers of 1/R 
we may drop 3.13 and write 

€ 
3.14 T,~H | eds. 


Expanding f(6@) in a Maclaurin expansion we obtain 
3.15 f(0) = i(R® + R — n)o — @(R’ + 4R) + >» (R+2*- Rye 


We now choose R so that 
3.16 R?+R-—n=0 
and define ¢ by means of 
3.17 = 6(R* + $R)}. 
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We may then write 3.14 in the form 


3.18 TT, ™~ rf ge Orregs 

where 

3.19 c= t+ 4m! 

3.20 nary 

$31 J = H/(R* + 4R)} 
@ : k 

3.22 F(z, ¢) = _ (z + (1 + 42) *,** He) ; 
k=3 ~ 


Since ¢« = R-™4, ¢ = O(R™) and c > © as R— o. Further for any fixed 
¢, F(z, ¢) is regular in the neighborhood |z| < 2 and e?‘** will have a 
Maclaurin expansion of the form 


3.23 eT =F Ual$) 2% 


where y,,(¢) is a polynomial in ¢. Hence 


3.24 T™~ > ( f" € Wm (d)d +) 2” + R, | 


m=0 on 


where 
3.25 R, = is (x in(6)2") d¢. 
Since ¥_(¢) is a polynomial in ¢ and c = O(R"*) one can place 
3.26 } dl On (d) dd = Z €' Wnt) de 
with an error that is of exponential order. Hence we write 
3.27 T™~ > ( £ em) dd en oa R, | , 
In order to complete our proof we merely have to show that, for fixed 


s, |R.| = O(\z|*). 
If the Maclaurin expansion of F(z, ), as a function of z, is written 





3.28 F(z, ¢) = 2d 2-(¢) 2", 
then by 3.22 
is E k—-1 — | (i¢)* 
3.29 a,(¢) = ride as (¢ +2" )(1 +42) "2 wp 
By using Cauchy’s theorem for derivatives one may easily show, for |z|<1, that 
1d k-1 $k b—2 2% 
3.30 rl de? (+ 2° )(1 + 42)" <2 A 





F 


w 








at 
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Hence 
3.31 lex(o) < 5 Sle” 
From 3.31 it is easy to show by induction that 
3.32 lar($)| < (4|6|?(1 + 4/¢])’. 
Making use of our lemma with K = (4|¢|)*, ¢ = 1 + 4/¢| we have 
3.33 Wm (d)| < (4]6])?(1 + 4) o])"(2 + (4/¢/)?)""". 
Hence 
3.34 > ¥m() 2"| < (4| 9)" + ‘eb? + (419/))""Is!" 
where = 
3.35 M = 1 — (1 + 4/¢|)(2 + (4/¢})*)|eI. 


Now z= 1/R and in 3.25 |¢| < c = O(R™‘). Therefore |z| |¢|/* = O(R-”*) 
~OasR— oo, —— for sufficiently large R, M > 4. Thus one can say 


3.36 | > Vm ( o) 2" n| < P,(|¢|)\2)’, 


where P,(|¢|) is a polynomial in |¢|. From 3.25 


3.37 IR,| < (| e*’P,(\¢)) a6) js! < f e*’P,(1|) dols|’. 
Since the integral exists for each fixed s we must have 
3.38 |R,| = O(|2|*) 


for a fixed s. This completes the proof of the theorem. Hence, an asymptotic 
expansion of 7, is given by 


n' e®ti®" ! © f em ( ) d¢ 
3.39 T,~ 3 1 = m 
2aR"(R’ + 4R)'LS R 
where R?+ R—n = 0. 
We have used 3.39 to show that up to and including terms of the order 1/n 


1 -S 7 119 
_ T.~ ghm() "2 (+q0,-f+..-) 
Numerically Too is given by 3.39 to be 2.40537... X 10®. Correct to six 
significant figures T1900 = 2.40533... X 10%. 
The method outlined here can be easily generalized, and can be used to find 
an asymptotic expansion for A,,¢. For example, the first term of the asymp- 
totic expansion of A,,, is given by 


n(1—1/p) 1/ 
3.41 Aso ~ p- n) é - p > 2. 


e 
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4. Arithmetic properties of A,,. Let a permutation x, on m letters, be 
represented as a product of disjoint cycles. It will also be convenient to repre- 
sent x by a diagram in the following way: Let a unit circle have m equi-spaced 
points on its circumference, labelled 1, 2,..., #. If x takes i into j, join ¢ and j 
by a directed line segment. Thus a permutation x will be represented by a circle 
and a set of directed inscribed polygons. For example, the permutation (1253) 
(4) (67) corresponds to Figure 1. 





4 


A rotation of the circle through 2rm/n (m = 1,2,...m” — 1) leaving the 
labels fixed will, in general, yield a new permutation having the same order 
as the old one. If m = p, p a prime, the only diagrams left unaltered by a rota- 
tion of the form mentioned above will be those corresponding to either the unit 
permutation J, or the regular directed p-gons, of which there are p — 1. 
We now use these concepts to prove 


4.1 A, = 1 (mod p),d # p, A,» = 0 (mod p). 


Since A,,, is easily seen to be (p — 1)! + 1, the result may be viewed as still 
another generalization of Wilson’s theorem. 


Proof. Suppose x* = 1, d # p. Apart from x = J, all solutions (diagrams) 
must come in sets of » by rotation through 2rm/p, m = 0,1,...,p — 1. 
This proves the first part of 4.1. If x” = 1, then x = J or the diagram for x 
must consist of a directed p-gon (of which there are (p — 1)!). If we eliminate 
the diagram for J, and the p — 1 directed regular p-gons, then the remaining 
A,» — » diagrams must again come in sets of », by rotation. Thus the proof 
is complete. 


In (1) it was shown that 
4.2 Trim =T, (mod m) m odd. 
We next derive a similar theorem in which m is unrestricted, namely 


4.3 Taim = T,.T, (mod m). 





“7 
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Proof. Forn = 0 the theorem is trivial and for m = 1 it follows immediately 
from 1.1. Assuming it true for » < k we have 


Trim = Trem—i + (R +m — 1)T em—s 
= 7,-1.7, + (Rk — 1) Te-2.Tn = Tey. Tn (mod m). 


5. The alternating group. Let B,. denote the number of solutions of 
x* = 1 in the alternating group on m letters. Further, let B,. = U,. Define 
V, and W, by 
5.1 U,+ V, = T,, Un — Va = Wy. 


To study U, it clearly suffices to consider W,. The analogue of 1.1 is given by 
§.2 W, = W,-1 = (n _ 1)W,-2, Wo = WwW, = ]. 


Proof. The only elements of order two in the alternating group on » letters 
are those which are the product of an even number of disjoint transpositions 
and the unit element. Hence the number of even elements of order two which 
can be obtained from the permutations of the digits 1, 2,...,2 — 1 alone is 
U,-1. Further, since a single transposition is odd, the only other such elements 
are obtained by involving the digit m in a transposition with some other digit, 
and multiplying by any other odd permutation of order two, involving the 
remaining » — 2 digits. Their number is clearly (m — 1) V,_». Thus 


5.3 U, = Uni + (mn — 1)V,-2. 
Similarly we obtain 
5.4 Ve = Var + (m — 1) Ug. 


Subtracting 5.4 from 5.3 yields 5.2. 
Following the lines of the proof of 1.3 given in (1) we easily obtain 


5.5 > W.x"/n! =’, 
n= 
The analogue of 2.2 is given by 
9) 
5.6 W, = MO) 


Since the proof is essentially the same as that of 2.2 we omit it. The well-known 
explicit formula for H,(x) yields the following analogue of 1.4: 

hn] 
5.7 W, = nl (—1)?/27j\(n — 2j)!. 

j= 


Combining 1.3 and 5.7 yields 


{n /4] 


5.8 U, = n! >> 1/{4*(2k)!(n — 4k)!}. 
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From the asymptotic formula for H,(x) and 5.6 we obtain 
5.9 W,, ~ (n/e)* 2 e”4 cos {(n + 4)? — inr}. 


The arithmetic properties of 7, and A, 4 also have direct analogues. Thus 
we obtain without difficulty, 


5.10 Bsa = 1 (mod pb), d # p, By» = A, ». 
Finally, the analogue of 4.3 is given by 
5.11 Wrim = W,.W, (mod m). 


The following is a short table of 7,, U,, W,. 








n Ze U, W, 
0 1 1 1 
1 1 1 1 
2 2 1 0 
3 4 l —2 
4 10 4 —2 
5 26 16 6 
6 76 46 16 
7 232 106 —20 
8 764 281 — 132 
9 2620 1324 28 
10 9496 5356 1216 





In concluding we wish to thank F. L. Miksa for computing for us the exact 
values of 7, up to and including Tis2. These values have been of value to us 
in checking different forms of our asymptotic formulae. 
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THE GROUP RING OF A CLASS OF INFINITE 
NILPOTENT GROUPS 


S. A. JENNINGS 


Introduction. In this paper we study the (discrete) group ring I of a 
finitely generated torsion free nilpotent group @ over a field of characteristic 
zero. We show that if A is the ideal of I spanned by all elements of the form 


G — 1, where G € G, then 
Be ee ect Ie Ie 6 oe 


and the only element belonging to A” for all w is the zero element (c.f. (4.3) 
below). This fact enables us to topologize T in a natural way. We may then 
define for @ “‘dimensional subgroups”’ relative to the ideal A which are the 
analogues of those considered by Magnus (7; 8; 9) for a free group. In part I] 
we associate a Lie algebra with @ in a natural manner via the group ring, and 
show that @ is a subgroup of the simply connected Lie group determined by 
this Lie algebra. To some extent these last results overlap those of Malcev 
(11; 11). However, our approach differs greatly from his, since our methods 
are intrinsic in the sense that topological considerations from the theory of Lie 
groups do not intervene. 


Part I: THE Group RING 
1. Preliminary notions. We follow the notation in (1) and write 
(H, K) = H'K-"HK 


for the commutator of elements H, K of a group G, while if $, R are subgroups 
of @ then (§, R) is the subgroup of @ generated by all (H,K) with 
H€ $, K€ &, and similarly for higher commutators. The group © is nil- 
potent and of class c if the lower central series of G is as follows: 


(1.0.1) G = G6,56,5...DG6,. 5 Gai = {1} 

where G1 = (G;, G) (¢ = 1,2,...0). A series of normal subgroups 

(1.0.2) G= Ri DRK2D...D Run D Rai = {1} 

is a central series if (R;, @) SC Rus (i = 1,2,...m) and the existence of a 
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central series is necessary and sufficient tor a group to be nilpotent. We recall 
also that for any central series we have 


(1.0.3) (R,, G,) CS Ru, i,j= 5 a 
The upper central series of a nilpotent group G: 
(1.0.4) 0) = 3. a Be-1 > see a Bi = Bo = {1} 


is usually defined by taking 3:, as the centre of G, and for i > 1, 3, as the 
subgroup in © corresponding to the centre of G@/3,-_,; in the homomorphism 
© — G/ 31-1. The following equivalent definition of the series (1.0.4) is prob- 
ably well known: 


THEOREM 1.1. 3, is the largest normal subgroup of © such that 
(3, G, G,...G) = 1, 
where the number of &'s in the above is i. 
Proof. Since 3; may be defined as the largest normal subgroup for which 


(3:1, G) = 1, assume the theorem true for 3,;, and let N be any normal 
subgroup of @ such that 


(N, G, G,...G) = 1 for z factors G. 
Then 
((M, G), G,...G) = 1 
and (M, G) | 3,1 by induction. That is, R is in the centre of G modulo 3,_; 
and hence M C 3;, from which it follows that 3, is the maximal normal 
subgroup having the property (M, G, ... G) = 1 with 7 factors G. 
A group will be said to be torsion free if every element of the group #1 is of 


infinite order. Using (1.1) we now establish, in slightly more general form, a 
result due to Malcev (9, corollary 2). 


THEOREM 1.2. Let G be a torsion free nilpotent group with upper central 
series 


G = 3.) Pe ee 3,5 Ro = {1}. 
Then 3/311 is torsion free. 


Proof. Suppose G€ 3, and G ¢ 3;-1, but G* € 3, for some positive in- 
teger a: such a G certainly exists if 3,/3,_, is not torsion free. Then 


(G*,G1) € Br, 
for all G, € G, and because of the identity 
(PQ, R) = (P, R)(P, R, Q)(Q, R) 


we have 


(Gs, G;) = (G, G;)* = 1 (mod Biz). 





a 
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Using (G, G,)* instead of G* we show similarly that 
(G, Gi, G2) = (G, Gi, G:)* = | (mod 3:3), 


and finally 


(G, Gi, Go,...,Gir)* = 1 (mod Bo), 
for all Gi,...,Gi1 € G, or, since Bo = {1} and @ is torsion free, 
(G, Gi, Gs, ... Gy) = 1 


for all G,,...G 1 € @. But by (1.1) this implies that G € 3,;, which is a 
contradiction, so that our result is established. 


Coro.iary 1.3. If & is a torsion free nilpotent group, then G@/ 3, (i = 1, 2, 
...€ — 1) 4s torsion free. 


2. Finitely generated torsion free nilpotent groups. Infinite solvable 
groups with maximal condition for subgroups have been investigated by 
Hirsch (3; 4; 5), who calls such groups “‘S-groups.’’ We show first that every 
finitely generated nilpotent (and therefore a fortiori solvable) group is an 
S-group. 


THEOREM 2.1. Every finitely generated nilpotent group satisfies the maximal 
condition for subgroups. 


Proof. Let & be nilpotent and finitely generated by the elements P;, Ps, ... 
P,, and let the lower central series of G be as in (1.0.1): then by (1, Theorem 
2.81), 


G, = {Q,, Q», ee 2., Geir}, 


where Q;, Qo,...,@Q, are the various formally distinct commutators of 
weight w in P;, P:,..., P,, there being only a finite number of the Q’s since 
there are only a finite number of the P’s. Hence G,/G,,; is an abelian group 
with a finite number of generators, and the series (1.0.1) satisfies the condition 
(3, (1.11)), so that @ is an S-group, as required. 

We recall (4, Theorem 2.22) that the elements of a finitely generated nil- 
potent group @ which are of finite order form a normal subgroup §, so that the 
quotient group @/¥ is torsion free. In what follows we call any group which 
is torsion free, nilpotent and finitely generated an N-group. That such groups 
exist tollows from the above remark. 

Now Hirsch has shown also (4, Theorem 2.311) that in any finitely gener- 
ated nilpotent group @ there exist series of subgroups, each normal in G: 


(2.1.1) G©@ = %,5%9:5% D5... D4, D Arai = (D) 


with the properties 
(2.1.2) A./M 41 is either cyclic of prime order, or an infinite cyclic group. 
(2.1.3) The number of infinite cyclic factors in any series (2.1.1) satisfying 
(2.1.2) is an invariant of the group G. (4, Theorem 2.23). 
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We prove now that if @ is an N-group, we may find series (2.1.1) all of whose 


factors are infinite cyclic, and indeed such that the series itself is a central 
series: 


THEOREM 2.2. Any N-group @ has at least one central series 
(2.2.0) @ = Ff: 2D... DF, D Fai = {1}, 
such that 

(1) §o/Bea1 (@ = 1,2,... 1) is an infinite cyclic group, 

(2) (+ G©) & Fu 


The length r of any such series is an invariant of the group, which we call the rank 
of ©. 


Proof. The invariance of the rank will follow from (2.1.3). Now by (1.2), 
since @ is an N-group, the factors 3,/3,;—; of the upper central series (1.0.4) 
of @ are torsion free, and by (3, Theorem 1.33), are finitely generated, so that 
3:/ 3:-1 is a direct product of a finite number of infinite cyclic groups. We may 
therefore refine the upper central series of G so that between any two consecu- 
tive terms 3, and 3,; we have a finite chain of subgroups 


Bi > Ua a Us Ddertrad Ure, - 3-1, 
so that each factor is infinite cyclic. Since 
(3. @) G 3 i-1, (Ui; @) G Bir - Usp 
and hence the refinement forms part of a central series of G. 
Any series satisfying (1) and (2) of (2.2) will be called an §-series of the 


N-group @. It can be readily verified that a group is an N-group if and only 
if it has ¥-series. 


The following follows at once from (4, Theorem 2.312) 


THEOREM 2.3. Let G be an N-group of rank r, and let S be a normal subgroup 
of © such that @/H is an N-group of rank s. Then $ is of rank r — s, and there 
is an §-series of © such that ¥1 = . 


COROLLARY 2.4. If G has an §-series 
© = f15 FD... FD Fe = I}, 
then ©/¥ 41 is an N-group of rank i, and § 14, is of rank r — i, 
(¢ = 1,2,...,7—1). 
Let © be an N-group with §-series 
G6 = FD FD... D Fa = {1}, 


and let F, be a representative in @ of a generating element of §, modulo § ;:: 
then any element G of G may be written uniquely in the form 


(2.5.1) G =FYF;"...F,;", 








a 





INFINITE NILPOTENT GROUPS 173 


where a,...,a@, are integers, positive, negative or zero. In what follows we 
assume that an §-series, and the elements F;,..., F, have been selected once 
and for all. We refer to the elements F;,..., F, as an ¥-basis for G, and to the 
representation (2.5.1) as the §-representation of G. Because of (2.2(2)) we 
have, since (§ 1, §,) © Be, where k > max(i, j), 


(2.5.2) FURS = FSF Fini’... Fy’ 

for some k > max(i, 7) (i,7 = 1,2,...7r). In particular we observe that F, 
belongs to the centre of G, and in particular 

(2.5.3) Fe'F;' = FF" 

for all j. 


It is known that an N-group may be ordered, and indeed this follows easily 
from the existence of the §-representation (2.5.1) and the relations (2.5.2). 
We need only remark that if 


G= Fe...F", H= Fi... F*, 


we may define G < H when, for 1 < s <r, a; = 8:1, ae = Bo, ... @y—1 = Bot, 
but a, < 8,. With this lexicographic ordering, it follows from (2.5.2) that if 
G < Hthen GK < HK, KG < KH, for all K € G, from which it follows that 


(2.5.4) if G; < Hy, and G2 < Ha, then GG. < AA, 


which is the condition that the relation “‘<”’ order the group. 


3. The group ring of an N-group. For a moment, let © be any group and 
let ® be any field of characteristic 0. The (discrete) group ring T of G over ® 
vill consist of all finite sums of the form 


(3.0.1) x= D Gs, 'Z E ®, G; E G, 


with addition, scalar multiplication, and ring multiplication defined in the 
natural manner. I may of course be of infinite rank over %, but in considera- 
tions involving only a finite number of elements of [ of the form (3.0.1), we 
may assume that the summations run over the same group elements in each 
case. As usual we identify the prime subfield of @ with the rationals, and the 
unit elements of I and of @ with the number 1. Thus there will be no con- 
fusion in supposing that, in an expression such as (2.5), the rational integers 
a, belong to ®: indeed we will often write, for example, if F € G, 


Fe = 1+ a(F — 1) + a(a — 1)(F — 1)? +..., 


where a@ is a positive integer and F* is considered as an element of I. 

Let G, be any subgroup of G: then the group ring IT, of G, over may be 
considered as a subalgebra of I. If G, is normal in G, and if G@’ = G/G,, we 
may consider also the group ring I’ of @’ over . We recall (6, §4) first the 
relationship between I, T,, I’. 
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Let A be the two-sided ideal of I spanned by all elements of the form 
(G — 1),G€ @. A necessary and sufficient condition that an element x € TI, 


= D Gs, 


belong to A is that }>&, = 0, and clearly ['/A = @s0 that A is a maximal ideal 
of I’. Similarly A; is the maximal ideal of I’; spanned by the elements G,; — 1 
for all G; € G@,. The homomorphism of G onto GW’ defined by 


GGG, 


may be extended in a natural fashion to a homomorphism of [ onto I” by 
means of the mapping 


(3.0.2) x= D&G, 32x = Dt,G,G,. 


The kernel of this homomorphism (3.0.2) may be identified as follows. Let 
G,, be a representative in @ of the coset G, @, and let x = Y£, G, be written 
in the form 


x= > EasGaG ip, Gy € @: 
then the mapping (3.0.2) may be written 
x= p> EasGaGig > p> (x ts) GG 
and x — 0 if and only if 
> fas = 0 
for all a. Hence if x +0, we may write 


(3.0.3) x= > t.sGa(Gig — 1), 


and conversely. Thus, we have proved 


LemMA 3.1. The kernel of the homomorphism of T upon IY" defined by (3.0.2) 
is the ideal TA,, where A, is the ideal of T, spanned by all elements of the form 
(Gis — 1), Gus € Gi. 


Suppose next that @ is an ordered group satisfying (2.5.4). Then I contains 
no divisors of zero, and no units other than scalar multiples of group elements. 
For consider two elements x, y ¥ 0 of I; 


x= > a,G, a, ~ 0, a», + 0, 
t=1 

y= p> 8, H,, 8; ¥ 0, B, ¥ 0. 
j= 


We may suppose G; < Gz <... <G,, Hi < H, <...< H,. Then in the 
product 
xy = a18;G,H, + ... + amBrGnH, 
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we have Gil; < G.H; < G,H, (1 <i<m, 1 <j < m), so that if xy = 0, 
2:8; = Gm8, = 0, which is false, and hence xy ~ 0 for all m,n. Similarly if 
xy = 1, then m = n = 1 and af; = 1, Gi; = 1, Hy = Gr. 

In particular, since any N-group is ordered, we may state: 


THEOREM 3.2. The group ring T of an N-group © has no proper divisors of 
zero, and no units other than scalar multiples of group elements. 


4. The structure of the ideal A. From now on @ will be an N-group. Let 
A be the ideal spanned by the elements G — 1, for G € Gas in §3. Consider the 
element G — 1 € A. Because of (2.5.1), we may write 


(4.0.1) (G — 1) = (Ft Fy’... Fe’ — 1), 
and because of the identity 
(4.0.2) AB —1= (A —1)+ (B—-1)+ (A — 1)(B- D), 
we may write G — 1 as a linear combination, 
(4.0.3) G-1= p> ite 
Here the coefficients 6,,, are integers and 
(4.0.4) Top = (Fi — 1)(Fi" — 1)... (Fi — 1), 
(i;,...%,) is a subset of the integers 1, 2,...,7 with 

Let <<... he SP. 
Also ay,,..., a, are the exponents of F;,,..., F,, in (4.0.1), where for 
brevity we have written p = (4;,...,%,) and 

w= (a;,,...,@4,)- 


Note also that the summation in (4.0.3) will extend over certain subsets p and 
w determined by the (a;,...,a,) of (4.0.1) and the identity (4.0.2). 
We have also the binomial identity, for integers a: 


(A* — 1) = {1+ (A** -—1)}'""-1 


E (el) cas - 1)’, t= 1,2,... lal, 


t 


(4.0.5) 


the positive or negative exponent on the right being taken as a is positive or 
negative. 


Consider now a product of the form 


(4.0.6) (Fit — 1)"(Fit — 1)"... (Fi — 1)" 
where 1 < 4; <i2 <<... <i, <r as in (4.0.4), 5;,...,5, are positive in- 


tegers, and where any combination of positive and negative exponents may 
occur, except that of course two factors 


(F,—1) and (F;'—1) 
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do not occur in the same product. We call w = 6, +... + 4, the degree of 
such a product, and denote by 


Puts Pats «>>» Petes w2=i12,..., 


the various formally distinct products of degree w, since for fixed w there are 
only a finite number of such distinct products (4.0.6). 
Applying (4.0.5) to each of the factors 


(Fi’- 1 ) 


in (4.0.4), we readily verify that every r.,,, and hence by (4.0.3) every element 
(G — 1), may be written as a linear combination, with integral coefficients, 
of products (4.0.6). Now every element «x of A is expressible in the form 


x= >) & (G,—1), &, € ®, 


and we see therefore that any element x of A may be written as a linear com- 
bination of products of the form (4.0.6), 


(4.0.7) x= ¥1P »,», + ¥2P 0. +...¢+ Yo wore 1 € ®. 


It is readily verified that the products P,, are linearly independent: for 
suppose 
¥1P «>, + ¥2P »,°, Fi. YeP eon = 0, 7: € 9, 


where the products are distinct and all y; ~ 0. An easy lexicographical argu- 
ment similar to that used in establishing (3.2) shows that a relation of this 
type would imply one of the same kind among group elements of the form 


G,; = a... i 


where the (a1;, ...,a@,;) are all distinct, which is impossible in view of the 
uniqueness of the representation (2.5.1). We omit the details, which may be 
supplied without difficulty. Thus we have proved 


THEOREM 4.1. The formally distinct products Py», of the form (4.0.6) are 
linearly independent and form a basis for the ideal A. The representation (4.0.7) is 
unique. 


As an immediate consequence of (4.1) we have: 
CoROLLARY 4.2. A relation of the form 
Po + (F,— I)pit+...+ (F, — 1)*p, = 0 


where po, P1,..., Pn are elements of the group ring of §%p11 implies p), = 0 
(k = 0,1,...,%). 


We define the weight of any product of w factors (F%* — 1): 
(4.2.0) I] (F- b, i=1,2,...,, 











if 








INFINITE NILPOTENT GROUPS 177 


where the factors occur in any order, and where 
(F,,-—1), (F;'- 1) 


may occur in the same product, to be 


(4.2.1) W= > 2", 
where the summation is taken over the same p,, (i = 1, 2,..., w) which occur 
in (4.2.0). 


In particular, therefore, the weight W of a product 
(Fr — 1)"(FF* — 1)"... (FF — 1)” 
(a, non-negative integers) will be 
W = a2! + a2? +... + 4,2". 
Consider now the identity 
(B — 1)(A — 1) = (A — 1)(B — 1) + AB(B"'A“"BA — 1) 
(4.2.2) = (A — 1)(B — 1) + (BABA — 1) 
+ (A — 1)(B"'A-"AB — 1) 
+ (B — 1)(B"'A-"BA — 1) 
+ (A — 1)(B — 1)(B"'A-'BA — 1). 
If p; < py, we have, because of (2.5.2), 
Fo Fo For Fo = TT Fee, = py ty yt 2s. oF 


and hence by (4.0.7) 
(Fi) Fe’ Fe F2— 1) = a Yor € ®, 


where P*,, are products of the form (4.0.6) with all p > p,. Now the weight 
W(P*,.») of every product P*,, is at least 2°‘+', and since 


W((Fx — 1)(Fa — 1)) = 2% +2" < 207, 
we see, by using (4.2.2) with 
A = F;,, B = F;,, 
that 
(Fe — 1)(F** — 1) = (F#' — 1)(F2'— 1) + products P,,, 
where 
W(Por) > 2% +2". 


By repeated application of this “straightening’’ process it may be verified 
that every product of the form (4.2.0) of weight W may be expressed as a linear 
combination of products of weights no less than W in which the factors occur in 
the natural order. 
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We observe that in these “‘straightened’’ products we may have consecutive 
factors of the form 


(F,* — 1)°(F, — 1)”. 


For convenience let us denote a “straightened” product of weight W (contain- 
ing perhaps both 


a 


(Fo'- 1)", (F,—1)” 

with a, 8, # 0) by 

(4.2.3) Ow = T1 (7 - )(F, - 1)”, p= 1,2,...,9 
where 


W = > ia, + B,) 2’, 


and some of a,, 8, may be zero. 
We establish now the principal theorem of this section: 


THEOREM 4.3. For all w, A“*' 1s a proper ideal of A”, and fl A” = 0, i.e. 
AF]. 2 OF)... 
and the only element in A” for all w is 0. 
Proof. Since (F; — 1)” # 0€ A” for all w, it is clear that no power of A 
vanishes, and it will be enough to prove the following: ‘“‘the only element x 
which belongs to A” for all w is zero.’’ The proof is by induction over the rank 


of @, and we show first that the theorem is true for groups of rank 1, that is, 
when © = { F}. 


Suppose that x ~ 0 is an element of A(G), where G@ = {F}, which is in A* 
for arbitrarily large u. Now by (4.1) we may write x uniquely in the form 
x= DivdF* —1)'+ 22 4,(F - 1)’, i=1,...,9; j=1,...,m. 


If x € A* for arbitrary u, y = F’x has the same property for any integer p, 
and hence by choosing p large enough to cancel all negative powers in x above 
we may suppose that if there is an element y # 0 in A" for all wu it has the form 


y= WF — 1)" +... + w(F — 1)™, v1 € &, 
where 0 < a1 < az <... <a, and y, #0 (k = 1,...,m). Choose u > a: 
then if y € A“, y may be written as a finite sum of products with >w factors: 
I= D 18: (F* — 1)*(F — 1)", B+ Bs >u> an, 


and hence 


n(F — 19" +... + m(F — 1)" — + 8, (F" - 1)" (F — 1) = 0. 
Now 
(F*— 1)" = + F"(F-1)*, 








— 
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and hence 


(F — 1)"[yi + ¥2(F — 1)" +... + mF -— 1) 
— > (+6,) F*(F — 1)" ] = 0, 


Since there are no divisors of zero in T, we have 

[vi + (FF — 1)" +... + 8,(F — 1) oP) = 0, 
which implies 7; = 0 (mod A), since 8; + 84 — a; > 0, and hence y, = 0, 
which is contra hypothesis. 

We now assume our theorem for groups of rank < r — 1, (and in particular 
for 2), and prove it for groups @ of rank r. Suppose that x + 0 is an element 
of A(@) which is in A*(@) for arbitrary large u. By (4.1) we have a unique 
expression 

x= ys YurP wes Yur € ®, 
which we write 
x = pot (Fi —1)"pi t+... + (Fi — 1)" 


where f» is a sum of P,, lying in A(¥2) of the form 
(4.3.1) Po =[] (FS - 1)”, p> 2, 


and p;,% > 1, is a similar sum with perhaps a term containing only an element 
of &. If x belongs to A*(@) for all u, so does y = F’ . x, and we may once more 
suppose, therefore, that our x has the form 


x= got (Fi—1)%qa t+... + (Fi — 1)" qm, 


where 0 < a1 < a2 <<... <a, and gq (k = 0,...,m) is also a sum of Py, 
of the form (4.3.1) with qi, g2,..., 9 # 0 and where go may or may not be 
zero for the moment. Consider go, . . . , gx: they are all in A(¥2) since otherwise 
(4.3) is false for G/F. By our induction assumption if go ~ 0 there is an wu’ 
such that go ¢ A“ (#2); if go = 0 there is, since gq; #0, a wu” such that 
qi ¢ A“ (%2). Let w = wu’ or u” + a as go ¥ 0, or go = 0, and choose u = 2’w. 

If x € A“(G), x may be written as a sum of products of the form (4.2.0) 
pb’ (u,), each having at least u factors and, therefore, of weight u; > u: 


x =X wo] (Fe - 1) =L yu) pu), yl) € @, 


where u,; > u. Now as in (4.2.3) each of the products p’(u,) in this sum may 
be “straightened” without loss of weight so that 


x = >) Fr) Q(r), 7(v,) € &, 
where the weights v, of all Q(v,) are at least u. These products Q(v,) are of two 
types: those lying in A(}2) and those containing a factor 


(Fy? — 1)" (F, — 1)" 
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at the beginning. Denote a Q(v;) in A(#2) by Q’(v,): then each Q(v,) is either a 
Q’(v,) or of the form 


(Fr' — 1)". (F, — 1". QQ). 
Because of our choice of u, every product Q’(u,) of weight >wu contains at 


least w factors, since the weight in A(¥.) of a product Q’(u,) is of the form 
(cf. (4.2.3)): 


(a2 + B2)2' + (a3 + 83)2? +... + (a, + B,)2™", 


and if this is greater than or equal to u = 2’w, then certainly 


(a2 + B2) +... 4+ (a, + B,) > w. 
Hence each Q’(u,) is in A”(}2). Now we have 
= go +(Fi — 1)" +... =D fyi’ (ue) + Oe(Fi' — 1)*(Fi — 1)°*O')}, 

where the right-hand side is in A*(@). If go # 0 then 

go = D> ¥:Q' (ur) 
by (4.2). This is impossible, since Q’(u,) € A”(¥:), and w= wu’, where 
go ¢ A” (Fe). If go = O we have 

DL viQ' (uz) = 0 
and hence, 

(Fi — 1)"(q. + (Fi — 1) ga +... — (DO 0'() +...) = 0, 


for it follows that 6, + 8,’ > a, for all k since if, for some k, 8 + 8,’ < a 
we could remove the factor 


(F, 1)P*t"s 
and get Q’(u,) = 0. Again this implies a relation 


a= a 50’ (ix) 
where Q’(d,) has at least (u’’ + a;) — a factors and is therefore in A”’ (#2) 
which is impossible, since gq, ¢ A”’ (¥:). 

We have proved, therefore, that 0 is the only element in A”(@) for all w. 
It follows that for every element x ~ 0 € A there exists a w such that x € A”, 
x ¢ A**!. Moreover, ['/A”*' is an algebra of finite rank over . 

In view of (4.3) it is natural to introduce infinite sums into the ring T' by 
means of a “‘A-adic’’ topology. Take the set {A”} (w = 1, 2,...) as a funda- 
mental system of neighbourhoods of the element 0 in T: then a sequence 
@, @2,...,@,... Of elements of T “converges” toa € I if, for given w, there 
exists an integer NV such that m > N implies that 


(a, — a) € A”. 
Let I* be the completion of I in this topology, and let A* be the completion 


: 
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of A. Clearly we may consider I'* to be the ring of all ‘‘power series” a* of the 
form 


(4.3.2) a* = ay + >ayd,, a€ @k=1,2,..., 


where d, € A*, while A* consists of all elements a* with ay = 0. As usual, we 
identify I with its isomorphic image in I'*. We note that 


(4.3.3) G'=1—-— (G—1)+ (G— 1)? —- (G—1)*+..., 
in I*, and that, more generally, if a* ¢ A* then a* is a unit in I*. 
It is clear that we have the following: 


THEOREM 4.4. For any integer n, T/A"*! > T'*/A**+» and (1 A** = 0. 


Because of (4.4) we can work with either I'* or T in what follows. 


5. The dimensional sub-groups of ©. The concept of dimensional sub- 
group relative to the ideal A (or A*) is now natural. Let D, (w = 1, 2,...) 
be the set of all elements D, € G such that 


D. = 1 (mod A**), 
If D, = D,' = 1 (mod A*”) we have 
Deo = 1+ dy, Dy = 1+ dy; dy, dy € A™ 
and hence 
DD, = 1 + dy + de + ded, = 1(mod A™), 
while 
D*=1-d,+d,—d,t+... 
Further, if G is any element of G, we have 
G"D.G = 1+ G"d,G = 1(mod A”), 
so that D, is a normal subgroup of @. We have also 
Dz'D,'DeD, = 1 + Dz'D; (ded, — dade) = 1(mod A****), 


where D, = 1+ 4d,, d,€ A**, and hence (Dy, D,) S Dey». In particular 
(Dy, G) S Days. These results may be summarized in 


1(mod A*”). 


THEOREM 5.1. The set D,, of elements D,, of G which are congruent to 1 mod A** 
form a normal subgroup of &, and the series 


G=9,27.2... 
is a central series of © with the property (Dw, Ds) S Dur» (w, 0 = 1,2,...). 


The subgroups D, we shall call (6; 7; 8) the dimensional subgroups of &. 
Suppose D, belongs to D, but not to Dy4:. We have 


De = 1+ d, 
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where d, € A*” and d, ¢ A***', and hence 
Dz = (1 + de)" = 1 + nd_mod A™**". 
Now since is of characteristic 0, nd, # 0 (A***') for any integer m, and hence 
De € Dott 


only if m = 0. Hence if Dy # Duis, De/Dw+: is torsion free. By refining the 
dimensional series (5.1) we obtain an §-series of the form (2.2.0). It follows 
that the dimensional series of © is of finite length. We have, therefore: 


THEOREM 5.2. The dimensional subgroups of © form a central series of finite 
length © =D, 2 D22...2 Dn 2 Dir = {1}; and are such that if 
De D Duss then Dy/Dwr: is a direct product of infinite cyclic groups. 


The problem of identifying the dimensional subgroups of @ we leave to a 
later paper. However, in view of (6, Theorem 5.5), it is natural to suspect that 
they are the minimal subgroups enjoying the properties indicated in (5.2), 
and this is indeed the case, as we shall prove. 


Part II: THe Liz ALGEBRA OF AN N-GROUP 


6. The Campbell-Hausdorff Formula in [*. For the rest of this paper we 
work with I'* and A*, since the possibility of infinite sums, which was incon- 
venient in most of Part I, is now essential. 

Let x* be any element of A*: then we may form 

¥o *, 
(6.0.1) expe) =1lts +5 +...45 +... 
where the series on the right certainly converges in the A-adic topology of $4 
to an element of I'*. Clearly X* = exp x* is a unit in I*, and 


(6.0.2) exp (ax*) exp (8x*) = exp (a + 8)x* 


for all a, 8 € &. We note that if exp x* = 1, then x* = 0, for (exp x*) — 1 


= x* u = 0, where 
* ¥o 


selto+s+... 


is a unit in I*. 
Similarly, if y* € A* we may define 
* . * * . 

(6.0.3) log(l+y)=y —ty +4y'—-by‘+..., 

where again the series on the right converges to an element of A*. It is clear 
that exp log (1 + y*) = 1 + y* and log exp x* = x*. In general, of course, 
exponentials and logarithms will not be defined for all elements of I'*, since 
for example the existence of exp (a + x*) would imply the existence of 
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expa in ®. In particular, however, log G is defined for all elements G € G, 
and we have 


(6.0.4) g = logG = (G — 1) — (G — 1)? +..., 
G = exp g. 
Now with A4* we may associate the Lie algebra A* = (A*),; in the usual 


fashion by defining the binary operation of commutation in (A*); by means of 
x* oy* - xty* = y*x*, 

for all x*, y* € A*. Clearly A* is of infinite rank over ®. Let 6* be an ideal of 

A*, and let M* = (6*),; then M* is an ideal of the Lie algebra A*. In particular 

we have 

(6.0.5) A* = (A*); D (A**); D (4*9),D.... 

Let us define the “lower central series” of A* by setting A* = A*,, and 


A*.4; = A*, 0 A*, where A*, o A* is the ideal spanned by all elements of the 
form /*, o /* with /*, € A*, and /* € A*. Then certainly 


* *y 
Ay C (A dn w=1,2,... 
so that 


(6.0.6) Ee “bo Fp: Pee Te Tae 


and the only element belonging to A*,, for all w is zero. Hence A* is ‘‘generalized 
nilpotent”’ in the usual sense. We note also 


Lemma 6.1. Jf [*,€ A*, (k = 1,2,...), then the infinite series 
G4+64...4¢6 4+... 


is an element of A*. 


Proof. Since [*,€ A*, C (A**),, /*,, considered as an element of A*, 
belongs to A**, so that the series in (6.1) converges to an element in A* and 
therefore is an element of (A*); = A*. 

Now as we have seen in §4, if X = 1 + x*, Y = 1 + y*, where x*, y* € A*, 


then 

XY = 1+ x* + y* + x*y* = 1+ 2*, 

XO =1l—2x*4+ x —x*¥?4+...=214+%, 
where 2*, @* belong to A*, so that the set ot all elements of the form 1 + x* 
forms a group @* under multiplication. Indeed, G* is a normal subgroup of 
the unit group of I, consisting of all elements of the form a + x*, where 
a #O0€ # and x* € A*. The group ©* is generalized nilpotent, for if D*, is 
the normal subgroup of @* consisting of elements of the form 1 + x*,, where 
x*, € A*, then 


(6.1.1) G =D:5D:5D;Dd... 
is a central series of G*, and 
(6.1.2) D. > G., w= 1,2,..., 


where @*,, is the wth term of the lower central series (1.0.1) of @*. Clearly, 
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therefore, G*,, + G*,,,; for all w, and the only element common to all the G*, 
will be the unit element. In particular © itself is a subgroup of @*. 

We recall now the well-known Campbell-Hausdorff formula (2; 9; 10) 
which reveals the intimate connection between the group @* and the Lie 
algebra A*. Let X = 1 + x*, Y = 1 + y* be elements of @*: then by (6.0.1): 
(6.1.3) X =expx, Y=expy, XY = expz, 
where x = log (1 + x*), y = log (1 + y*) and z = log (1 + x* + y* + x*y*). 
Then the Campbell-Hausdorff formula gives z explicitly in terms of x and y 
as follows (2, formula 26; 9, formula 10): 


(6.14) z=x+y+}(xoy) +79 (xeovyoy) +2 (yoxox) 
+x(yoxoxoy)+..., 


where xo y = xy — yx as usual, where we have written, for example, 
xoyoy=(xoy)oy, yoxroxoy = ((yox) ox) oy, etc., and where the 
right side of (6.1.4) is an infinite sum of the type considered in (6.1) and 
therefore convergent. All the coefficients on the right are known to be rational 
and therefore belong to any @ of characteristic 0. 

For later use we note that, if X-'Y-'X Y = (X, Y) = expec, then 


(6.1.5) c= (xoy) + 3(xovyox) + 3(xoyoy)t+..., 
where the right side is again an infinite sum of commutators in x and y. For 
convenience we introduce the notation 


x[t1, 12, ees is] = X%1,0%1,0..-O0%4,. 

From (6.1.5) it follows that, if X; = exp (x,) and if (Xi, X2,..., X,) = expe, 
then 

(6.1.6) Cn = (%10%20...0%:) + >, ly, w=n+i1,n+7,... 
where each /, is a rational linear combination of simple commutators of the 
form x[i1, 42,...,%»] with w >m-+ 1, and where i, i2,...,% is a permu- 
tation of the integers 1, 2,..., with repetitions allowed, but with each of 
1, 2,...m occuring at least once. 


7. The Lie algebra of an N-group. Consider now the module & of A* 
spanned by all elements of the form g = log G, where G runs over the N-group 
@. The principal theorem of this section is to the effect that 2 is a Lie algebra 
of finite rank r over ©, where r is the dimension of @. As a first step towards 
this result we prove: 


LEMMA 7.1. If w>c, the class of G, and if g; = log G,, i= 1,2,...w 
where G;, Go,..., Gy € G, then 


£10£20...0 8» = 0. 
Proof. By (6.1.6) we know that, if w > c, 


1 = (Gi, Go, ..., Gu) = exp (g10220.--0280 + Lh), 
u=w+i,wt+2,..., 


' 
f 
i 
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where 1, is a linear combination of commutators of the form g[i;, iz, ... , tx], 
so that 

(7.1.1) £10£20-.--0f6 = —Lh, u=-w+i,w+2,.... 
Now every element of the form g[i:, iz, . . . , i,] € A*™, so that (7.1.1) implies 
that every commutator gi0g:0...0g.€ A***'. In particular, every 
gli, i2,.. +» ty] € A**, and again (7.1.1) implies that g: 0 g2 0... 0 Ze € A****. 


Continuing in this way we can show that 
£10220..-.08~€ A**t’ 


for all values of NV, and therefore g: 0 g2 0... 0 Z« = 0, since zero is the only 
element common to all A”**”, 

It follows from (7.1) that the Lie algebra generated by all g = log G is nil- 
potent of class c. We prove now 


LeMMA 9.2. Let Fi, F2,..., F, be amy §-basis for @ as in (2.2), and let 
f: = log F, (i, = 1,2,...,7r). Then if g = log G, where GE G, there exist 
rational numbers 71, Y2,.. +. Yr So that 

g= fit vfet... t+ de 
In other words, the module_L spanned by the elements g = log G for all GE © 
is of finite rank r over ®. 


Proof. By an easy application of the Campbell-Hausdorff formula (6.1.4) 
we may show that 


Fi’ Fy"... Fr" = exp(aif1) exp(aofs) . . . (exp a,f,) 


(7.2.1) 

= exp(aifi + axf2 +... +af,+ >, B,<,) | ee | 
where a,...,a@, are integers, §;, B2,...8, are rational, and ¢), Co,...,¢, are 
commutators in f;,...,f, of the form f[t;, i2,...,%,] with ¢ < c. (Note that 


because of (7.1) only a finite number of commutators c, will occur in (7.2.1).) 
Since every G may be written 


G = all ee . 
it will be sufficient to prove that, for all integers ¢, 
(7.2.2) fu0ofno---ofa =D vis j=1,2,...,7 
where the 1, ..., ¥, are rational. 


As in (4.2.1) let us define the weight of f; to be 2‘, and the weight of a com- 
mutator f[7;, 72,..., %,] to be 


W(t) = 2° +2°+...+2". 


Now because of (7.1) all commutators f[i;, 72,...,%,] with / > c vanish, and 
since i}, i2,..., 4%, are all at most 7, any commutator in the f,; of weight suffi- 
ciently great vanishes (e.g., any commutator of weight greater than r = Wp; 
so that W, is an upper bound to the weight of non-vanishing commutators, 
although by no means the least). Hence (7.2.2) is satisfied trivially for all 
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commutators f[t:, i2,...,%,] of weight >Wp». Assume therefore that (7.2.2) 


holds for all commutators of weight greater than W(t). Now by (6.1.6) and 
(7.1) we have 


(7:2.3). (Fi, Fas. .-, Fu) = exp(fi. of. 0---0fn +d, BG), 


yt > eo 
where 8,’ are rational, and each c,’ is a commutator of the form f[j:, js, . . - ju] 
of weight W(u) greater than W(t) since i;,...,%, are contained among 
Ji» jz, ++ ++Jq By our assumption, therefore, 58,’ c,’ can be expressed as a 
rational linear combination of the fi, ...,/,, and we re-write (7.2.3) in the 
form 


(7.2.4) (Fi, Fuy---s Fu) = exp(fiofi,0---ofu +2 Bis) 


(cog oe? 
On the other hand, by (2.5.2) we have 


(7.2.5) SO re = Fe“ Se <p 

where k>t—1+m, m = max(i, i2,...,%,). Consider any f, with 
s >t— 1+ ™m; the weight of f, is 2* and we have, since certainly 1; ¥ 12, 
(7.2.6) W = 2* +... +2" < 42" < 25'S" < 2. 


Now (7.2.5) may be re-written, as in (7.2.1), in the form 


(7.2.7) (Fi, Fu... Fu) = explahe +... fade + > Bic’) 

where 8,” are rational and each c,” is a commutator of the form /[k:, ke, ... , Rr] 
with k;,...,k, > k, so that the weight of each c,” is greater than W by 
(7.2.6). By our assumption, therefore, we may express each c,” rationally 
in terms of f;,...,f, and (7.2.7) may be re-written 


(7.28) (Fi, Fu,---, Fu) = exp(>d 4), Jah Bicnh 


with rational 5;. Equating the right-hand sides of (7.2.3) and (7.2.8), we see 
that (7.2.2) holds for any commutator of weight W if it holds for all commu- 
tators of weight > W, which proves that (7.2.2) is true in general. 

Since in particular we have proved that 


(7.2.9) fiofs => vinfhes 1jk= = ae 2 
where the +; are rational, we may now state our main result: 

THEOREM 7.3. The elements g = log G for all G © @ span a Lie algebra & of 
rank r over ©, where r is the dimension of ©. A basis for 2 may be taken as the set 


fi = log Fi, ...,f, = log F,, where F,, Fo,..., F, is any §-basis of G. The 
structure constants of 2 relative to a basis of this type are rational. 


8. The Lie group associated with G. Suppose now that @ is the real field. 
We show that G*, the group of all elements of the form 1 + x* with x* € A*, 
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now contains as a subgroup a real simply connected nilpotent Lie group & 
whose Lie algebra is the algebra & determined by @ as in (7.3). The N-group G 
is a subgroup of &, so that in particular we establish a theorem of Malcev 
(10, theorem 6). 

For any G belonging to the N-group @ let us define @ for real — by the 
equation 
(8.0.1) G* = exp (ég), 


and let & be the set of all such elements Gt. Because of (7.2) we may write 
(8.0.2) Gt = exp (i: fi +...+ &,f,), 


where £:,...,& are real. If Gt = exp (mfi+...+,,) then we have 
G* Gt = G‘, where 
GF = exp (fifi +... + ff) 


and 


So = Pil&i,..., &3 Bde oo 9 Bede 


p; being a polynomial in the &,,...,£&,; m,...,2, determined in the usual 
fashion by the constants 7; of (7.2.9), so that if we take (£:,..., &) as the 
coordinates of G* it is clear that % is a Lie group. Indeed (, . . . £,) are canoni- 
cal, and certainly the Lie algebra of & is £. We obviously have G as a discrete 
subgroup of &. Hence we have: 


THEOREM 8.1. The set of all elements exp (§g), where & is real and g = log G, 
G € G, forms under multiplication a real simply connected Lie group U whose 
Lie algebra is the rational Lie algebra 2 determined by &. In particular, therefore, 
every N-group © is a discrete subgroup of a Lie group U with rational Lie algebra. 
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GENERALIZED MATRIX ALGEBRAS 
W. P. BROWN 


1. Introduction. The algebras considered here arose in the investigation 
of an algebra connected with the orthogonal group.' We consider an algebra 
W of dimension mn over a field K of characteristic zero, and possessing a 
basis {e,;} (1 < i < m; 1 <j < nm) with the multiplication property 
(1) Cinq = Pili dy © K. 
The field elements ¢,; form a matrix © = (¢,,) of order m X m. It will be 
called the multiplication matrix of the algebra relative to the basis {e;,5}. 

Such algebras will be called generalized matrix algebras. If m = m and ¢,, = 
5,, (the Kronecker delta) we have a total matrix algebra. 

An element 6 of & has an expression in terms of the basis {e;,} of the form 


b = > 2, beg Cogs Diy € K. 


i=1 j=l 


The correspondence } — B = (5,;) is a one to one correspondence between 
the elements of & and the set of m X nm matrices over K. Ifa € Wanda— 
A = (a4;), then 


ab = Z Os7645 0 bycéve 
iJ D.@ 


I 
M 
-_— 
M 
by 
Roe 
> 
~ 


It follows that the product ab corresponds to the matrix A®B; ab > A®B. 
The most general change of basis of & may be effected by a transformation 
of the type 


(2) Tuy = Zz , ey” Crys ou" EK 


A=—1 g=l 
(l<it<m; 1<j <1). We use double suffix notation with lexicographic 
ordering to describe the matrix of this transformation. The element of its 
(ij)th row and (Az)th column is o,)“. The elements f,, of YU constitute a basis 


for & if and only if this matrix is non-singular. We consider however a special 
type of transformation, namely: 


(3) fu = > Saery; So € K. 
rA=1 


This may be written in the form of (2) by setting o,)* = saé,. This is the 
element of the (i7)th row and (Au)th column of the Kronecker product S x I 
where S is the m X m matrix (s,,) and J is the » X nm unit matrix. It follows 
Received November 2, 1953. 
'The algebra concerned is w,*. For definition see (1) and (2, chap. V, 5). 
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that the elements f,, in (3) constitute a basis for UM, if and only if S = (s,,) 
is non-singular. 

The nature of the multiplication rule is preserved under transformations 
of the type occurring in (3). Indeed 


fishee = > Saery 2, Spulug 
= p> S aSpuP julrg 
(x Sum) Sa 


by (3). Hence fisfog = Winf ig Where Wy =>~bySp» Relative to the new basis 


{fis}, the algebra has therefore the multiplication matrix ¥ = ®S7, where S™ 
denotes the transpose of S. 
We make a further change of basis, namely 


n 
84 = Li forrs rr, € K, 


(l<i<qm;1<j <2). Again the elements g,, of & constitute a basis if and 
only if the  X m matrix R = (r;,;) is non-singular. The multiplication rule is 
again transformed; 


£1 kre = OpSig where Oy = > rir © K. 


Relative to the basis {g,,} the algebra has the matrix 6 = R*6S". 
If ® has rank r, non-singular matrices R and S may be chosen so that the 


n X m matrix @ is 
I, 0 
o=(3 >. 


where J, is the r X r unit matrix and all other submatrices of © are zero. 
A basis such as {g,,}, relative to which the multiplication matrix has this 
simple form, will be called a special basis. While the types of transformation 
used preserve the rank of the multiplication matrix, it has not yet been 
demonstrated that the rank of a multiplication matrix is an invariant for W. 
This, however, will be obvious later. 


2. The structure of generalized matrix algebras. We now assume that a 
special basis has been chosen for &% and that the multiplication matrix @ has 
the special form of 6 above and has rank r. Suppose that an element 6 € W 
has a matrix B whose partitioned form is 


* Bs Bs) 
B= (55): 


the dimensions of the submatrices being r Xr for Bi, r Xn —r for Bo, 
m—rXrfor B; and m—rxXn—r for By. 

The radical of A consists of all elements that are properly nilpotent, i.e. 
all elements 5 of & such that for every a € U, ab is nilpotent. The matrix cor- 
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responding to the ‘th power of d is (B®)*-'B. It follows that 3 is nilpotent if 
and only if B@ is a nilpotent matrix. 

Let a and c be elements of & whose matrices are A and C respectively. 
The matrix of the product abc is the product of matrices A#BSC. Now 
suppose B, = 0. Then 


_ (1, 0\(0 Bi)(I ‘) . 
ope = (J a. B/\o 0) =* 


It follows that 5 is properly nilpotent and has index of nilpotence <3. 
On the other hand, for 6 to be properly nilpotent it is necessary that the 
matrix A B® be nilpotent for all matrices A. 


_ (A; A:\(1. 0 (3: Bs)( I. °) _ (48: 0) 
aope = (4: 4: 0 0/\B; B./\0 0/ ~ \A3B, 0 


The tth power of this matrix has (A,B;)‘ in the first submatrix position. 
It follows that } is properly nilpotent only if A,B, is nilpotent for all r Xr 
matrices A;. This can only occur if B; = 0. Hence the radical of & consists of 
all those elements b whose matrices relative to a special basis, have their 
first submatrix zero. 

A Wedderburn decomposition of & into a direct sum of a semisimple sub- 
algebra % and the radical ® is now clear. % consists of all elements 5 of W 
whose matrices relative to a special basis, have the submatrices By, B; and 
B, all zero. For such elements the mapping b — B, is a ring isomorphism of 8 
onto the total matrix algebra of degree r over K. Hence 9 is simple and r is 
an invariant of the algebra. 

We see that a generalized matrix algebra is either simple (m = nm = rank ®) 
or is non semisimple and simple modulo its radical. If the algebra is simple 
it certainly possesses an identity element. On the other hand let the generalized 
matrix algebra Y& possess an identity element e so that ea = ae = a for alla 
in &. Let E and A be the corresponding matrices. We must have E@A = 
AE = A for all matrices A. Hence E® must be the m X m unit matrix 
and ®E must be the m X m unit matrix. This can happen only if m = m and 
® is non-singular. The algebra is then simple. We restate the above result in a 

THEOREM. A generalized matrix algebra is either 

(i) simple, or 

(ii) mon-semisimple and simple modulo its radical. 


It is simple if and only if it possesses an identity element. 
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SOME THEOREMS ON MATRICES WITH 
REAL QUATERNION ELEMENTS 


N. A. WIEGMANN 


1. Introduction. Matrices with real quaternion elements have been dealt 
with in earlier papers by Wolf (10) and Lee (4). In the former, an elementary 
divisor theory was developed for such matrices by using an isomorphism 
between Xn real quaternion matrices and 2m X2n matrices with complex 
elements. In the latter, further results were obtained (including, mainly, the 
transforming of a quaternion matrix into a triangular form under a unitary 
similarity transformation) by using a different isomorphism. Certain other 
related results have also been obtained (1). Others, including Moore and 
Ingraham, have considered quaternion matrices earlier. 

The intent here is to consider how other theorems which hold for matrices 
in the complex field may hold for quaternion matrices. To do this, the iso- 
morphism in (4) is employed. First, an analog of the Jordan normal form is 
obtained; this result is closely related, of course, to the final result in (10) 
concerned with necessary and sufficient conditions for similarity of quaternion 
matrices, but here a proof is employed which depends entirely on known 
complex matrix theory, which throws light on the structure of the similarity 
transformation, and which leads in a natural way to a definition of elementary 
divisors for quaternion matrices. Next, this Jordan form is used to obtain some 
results concerning commutative matrices. In part 4, the familiar polar form 
of a complex matrix is shown to hold in the quaternion case. Next, some fur- 
ther properties of normal quaternion matrices are verified and, in the final 
section, some properties of quaternion matrices relative to unitary (quaternion) 
equivalence transformations are obtained. 


2. An analog of the Jordan normal form. Let the Xm quaternion 
matrix A be written in the form A = A,+jA2 where A; and A: are (uniquely 
determined) matrices with complex elements (where every quaternion element 
is considered as written in the form a = (a,+a2i)+j(a;+a,i) where each 
a, is real). Form the 2m X2n complex matrix 


A, — A;° 
adie ” 
2 Ax° 
(where A© denotes the matrix obtained by taking the complex conjugate of 
each element of A and, later, A©? denotes the transpose of A©). According to 


(4), the correspondence between A and A* is an isomorphism and has proper- 
ties as developed there. 
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If it is possible to show that for a given A* there is determined an Xn 
matrix J; in the (complex) Jordan normal form such that a non-singular 
matrix P and a matrix J exist so that A*P = PJ where P and J have the forms, 


respectively, 
‘o “A be | 
and ° 
2 P,° o 


then an analog of the Jordan normal form can be obtained for the quaternion 
matrix A. For it can be easily seen that the inverse of P must also be of the 


form 
ot 
Q Q,° 
so that P-'A*P = J. But according to the nature of the correspondence 
between A* and A this implies that (Q:+7Q2)(A1+jA:)(Pit+jP:2) = Ji+j-0 
or that (Q: + jQ2)A(P1+jP:) = J: where (Q: + jQ2)(P:1+jP:) = I, so 
that A is similar to a complex m X nm Jordan form J. 

Since A* is a matrix with complex elements, there exists a non-singular 
matrix P such that P-'A*P = J is the Jordan form of A* so that A*P = PJ. 
In the following steps it will be shown that a P and a J can be obtained which 
satisfy this relation and are of the desired form. 

(a) Let a:,...,@m be the distinct (complex) characteristic roots of A*. 
Then each column of P is a column vector v with 2 elements satisfying one 
and only one of the following relations: 

(i) A*v = vay, 

(ii) A*o = w+ vay, 
where w is the column vector adjacent to v on the left. All 2” column vectors 
are linearly independent and for each a, there exists at least one column of 
the type (i). 

(b) For simplicity in notation let a be a root for which 9, v2,..., 0, are 
the set of column vectors of P of type (i) relative to a. Let the column vector 
v;* be defined relative to v; as follows: If v; is a column vector whose transpose 
is the row vector 

[v11, Vai, - + » Unt» Wii, Wai, -- - Waal, 


then v,* is the column vector whose transpose is the row vector 
[—Wir, —Wai, . . . — War, O11, B21, . « - Dai). 


If 7; is not the zero vector, then v; and »,* are linearly independent, for if 
C1; + cqw,* = 0, it follows that 
(c161 a Co€2) Wer = 0, (361 + Co€2)0p1 = 0 (k = 1, 2, coos n), 


so that c,; = c, = 0. Also, if A*v; = va, it follows that A*v,* = ,*a. Let us 
consider, first, vectors of type (i). 
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Let a be real. Then »; and 9,* are linearly independent vectors of type (i) 
for a, and either exhaust the number of such linearly independent column 
vectors or there exists another, say v2, which is linearly independent of 9; and 
v,*. Form 2*; then 9, 9:*,%2, and v;* are linearly independent, for if 
C1 + Cw1* + C2 + cw2* = 0, then En,* — Ew; + Ew.* — Ew: = 0. But by 
properly combining these relations, it would follow that 


(Es¢1 + Ca€2)01 + (Esco — Cabi)0i1* + (Cabs + Crés)v2 = 0, 


so that c; = cs = 0 and so, from above, c, = c; = 0. Either 2, 0:*, v2, 72* 
exhaust the number of linearly independent vectors of type (i) for a, or they 
do not. By means of this process there is obtained a set of linearly inde- 
pendent vectors of the form 1, :*,..., 0%, %* which provide a basis for the 
vectors of type (i) corresponding to each real a. 

Let a be non-real complex. Then if the matrix P contains a set of vectors 
M1,02,...,0, such that A*x,;=va (j =1,2,...,¢), it follows that 
A*o,* = 0/*@ (j = 1,2,...,¢) (where the v/* are linearly independent since 
the v, are), that there are no other vectors linearly independent of these for 
which this is true, and that @ is also a root of A*. 

Since a, a, ..., @m, are distinct, the sets of linearly independent vectors of 
type (i) obtained in this way are linearly independent and are equal in number 
to those column vectors of type (i) in the matrix P. 

(c) Consider vectors v of type (ii); these may be written as (A* — a,J)v = w; 
and it follows that (A* — @,J)v* = w*. 

Let a be real. Let there be 2k vectors 11, v2, ..., Up, .+ +» U2 Of P of type (i) 
corresponding to a and let them be written in such an order that if there exist 
for some v, vectors 0,” of P so that 


(iii) (A* —_ al)vy = U1, 

these vectors, 71, ¥2,...,%, are written together and first in this ordering. 
Then » must be even and a set of linearly independent vectors of the above 
v, »* type can be obtained which span the same space as 2, ¥2,..., 0». For if 


(A* — al)v; = 9;, then (A* — al)v;“* = v;*, and since v; and »,* are 
linearly independent, v;“ and v,;“* are also. Either p = 2, or the process can 
be continued as before, so that p = 2g. In this way we see that there exists a 
set of linearly independent vectors, 2, 01*,... , 0g 0_", ++» Ve," (which 
form a basis for all vectors of type (i) corresponding to a) such that »,", 
y)*,..., 0. v,*, provide a basis for the space spanned by 7°", . . . , 029°” 
as taken above where v, and v,‘" are related as above. If for some of the v,‘” 
there exist v,“ in P such that 


(A* —_ al)v = v/, 


the above process can be repeated, and a set of vectors, taken notationally as 
1, »,°*, ... , v,", v,* (which span the same space as the linearly indepen- 
dent v,), is obtained which stand in the same relation tov;",9;")*, ... , 79°, 
v,‘* as the latter do to 2, 01", . . . , Ue, Ue". 
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If a is non-real complex, let 7, v2,...,»,...,%, (the vectors of type (i) 
corresponding to a) be ordered in such a way that for »;,2,...,v, there 
exist 9), vo, ..., 0," satisfying (iii). Then (A* — aJ)v,* = v,*, i = 1,2, 

.»P, and v,°*, v*,...,v,“* are such that there exists no vector, w, 
linearly independent of them such that (A* — @/)w is in the space generated 
by v;*, v2*,...,v,*. For some v,“” there may exist v,; such that 

(A* on al)v _ vi; 
in this way a set of v,“* are determined and the process is seen to be a general 
one. 

(d) By the above, a set of 2m linearly independent vectors, taken notation- 
ally as Wy, We, ..., Wp, Wi", We", ..., Wa", are obtained such that either 
A*w, = we (and so A*w,* = w,*a, for any a), or, for some w, satisfying 
A*w, = wa, there exist among the above 2m vectors certain vectors taken 


notationally as w,, w,,..., w such that 
(A* — al)w, = wy, 
(A* — al)w,? = w,, j = 2,3,...,5; 


in this case it follows that 
(A* — al)w oo" = w,*, 
(A* — al)w<” = wi” [. > oo 


It is now evident that by properly arranging the w, and w,*, a 2m XK 2n 
matrix P can be obtained such that A*P = PJ as indicated above. If in J; 
(as used there), the roots a = a + bi are such that b 2 0, then a canonical 
form has been obtained for A* and hence for the quaternion matrix A. 


THEOREM 1. Every n X n matrix with real quaternion elements is similar 
under a matrix transformation with real quaternion elements to a matrix in 
(complex) Jordan normal form with diagonal elements of the form a + bi, b 2 0. 


3. Properties of commutative matrices. According to a theorem due to 
Taber (5), if a matrix A with complex elements is non-derogatory, the only 
matrices commutative with A are polynomial functions of A. An equivalent 
theorem had been previously given by Frobenius (3, Theorem XIII). 

In order to obtain an analog for this theorem where A contains real quater- 
nion elements, let such a matrix A be defined to be non-derogatory when its 
Jordan normal form matrix (as obtained in the preceding) is non-derogatory. 

Let A and B be quaternion matrices such that AB = BA where A is non- 
derogatory. Let PAP? = J = J, +J.+...+J, be the Jordan form of A 
where: 


a, 1 om” sed ee 
0 a; 1 oe 
J,= jibe : ¢= 12, , mM, 
0 ay 1 
0 0 ay 


~—— gr 





Le —O 


MATRICES WITH QUATERNION ELEMENTS 195 


where a, # a, when i #j and a, = a, + ib, bs 2 0. Let PBP-' = B, so 
that JB, = B,J. 


LemMaA. B,; = By, + By +... + Bip where B,, has the same order as J;, 
where 





ba gg eee bn | 
| 0 ba bis 
(0 OO ba 
By, ™ | eee 
ba ge gs 
| eee 0 ba bie 
LO er 0 0 ba 


and where 

(i) af a, is real, the non-diagonal elements of B,, are quaternions while the 
diagonal elements are complex; 

(ii) if a, is non-real complex, the elements of the corresponding B,, are complex. 


The following may be noted: if a isa non-real complex element, b a quaternion 
element, and ab = ba, then b is a complex number; if a is non-real complex, 
if 8 is complex, and ¢ a quaternion element such that ac = ca + 8, then ¢ is 
complex and 8 = 0. Also no a;, above, is the conjugate of an ay. 

Let B, = (0,,), let J; be of order r X r, and consider the upper left r X r 
sub-matrix of the product JB, = B,J. The following relations result: 


ab, = by 10, 


ayb,, — Dr i-1 + b, 1, (= 2, 3, seoate 

aba + bia = Due, Se i Bexeel = i 
‘e * Fae 

Dee + Degas = Oe a + Oren, \i- ‘ters | 


If a; is real then, although the 5,, are quaternion elements, all commutative 
properties hold for these relations (as in the complex case as treated by Taber) 
and the upper left r X r matrix has the form B,, with all quaternion elements, 
in general. If a; is non-real complex, it follows from the first relation that b,; 
is complex; from this and the third relation it follows that all elements in the 
first column above 5,, are complex (and in fact, except for },;, all are 0); 
from the second relation it can be seen that all elements of the rth row of this 
submatrix are 0 except b,, which is complex. Using the fourth set of relations, 
we see that the remaining elements are complex, all necessary commutative 
properties hold, and that the submatrix has the B,,; form. B,, now has the 
required form unless, for a real a;, the diagonal elements are quaternions; if 
so, there exists a quaternion element } such that bbb = 8 is a complex num- 
ber where bb = 1. Form the m X m matrix Q = bJ, + I. where J; and J, are 
identity matrices and J; is of order r X r. Then Q-' = 67, + I, and QB,Q- 
has the form required and QJQ-' = J. 
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Let J: be of order s X s, and consider the s X r submatrix directly below 
By, in the matrix B,. Upon comparing corresponding elements of this s X r 
submatrix in the product JB, = B,J, we see that the set of following relations 
appear: 

2D 49,1 = Db r45,101, 
Dre s.c = Dees ca + Oras. rr, fo, aa 
aba + digi = Oye, i=r+1,...,r+s-—-1, 


ey = ae 2 
de + Degas = Bea + Dies, ing go 1. 


Since, for 1 # j, a, # a, and a; # &,, it follows from these relations that all 
elements of this s X r submatrix of B, are zero. In this way it can be shown 
that B, = B,, + Bz where B;, has the form given in the lemma. When B, 
is treated in like fashion, the lemma follows. 

Consider next the possibility of representing this B, as a polynomial in J; 
where J, contains only complex elements. It is evident (from the work of 
Taber or by merely considering the set of equations obtained) that it is possible 
to determine two sets, x, and x,’, i = 0,1,2,...,2 — 1, of quaternion ele- 
ments such that 


n—1 n—1 
B, = Do xJ' = > J'x?. 
i=0 i=0 


If all the diagonal elements of J are real, x,J‘ = J‘x,; if all the diagonal ele- 
ments of J are non-real complex, all elements of B,; are complex and so are the 
x, so that again x,J‘ = J‘x,; and the same would be true if all the elements of 
B, were complex regardless of the nature of the a, in J. In these instances if 
x, = pyt,; (where p, is the real absolute value of the quaternion element 
x, and u, the related quaternion of absolute value one), then 


n—1 n—l 
B= P"B,P = > p.P(ul)P- Ps) P = > pW A' 
i=0 i=0 


where U, = P-'(u,J)P and U,A = AU, for each i. It follows that: 


THEOREM 2. If A and B are quaternion matrices, if AB = BA, and if A is 
non-derogatory with either all real or all non-real complex roots, then 


n—1 


B= > pV A‘ 


i=0 


where the p,are real, U,A = AU, for eachi, and each U, has a single characteristic 
root of absolute value one. 


4. A polar form. Every complex number has the familiar polar form 
pe“ and, as has been seen, the same is true for a quaternion. For a matrix A 
with complex elements a polar representation has been obtained when A is 
non-singular by Wintner and Murnaghan (9) and when A is singular by 
Williamson (7). It also exists for quaternion matrices according to the 
following: 


—_—- 


; 
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THEOREM 3. Every n Xn matrix A with real quaternion elements can be 
expressed as A = H,W, = W,K, where H, and K, are hermitian (quaternion) 
matrices and W, is a unitary (quaternion) matrix; if A is non-singular the 
representation is unique, and if A is singular, H, and K, are unique but W, is 
arbitrary to some extent. 


Let A = A; + jAz where A; and Az; are (as in §2) uniquely determined 
matrices with complex elements. Then A is isomorphic to A* where 


: A, -—A;° 

* — 
(i) A i? rl , 
Since A* has complex elements A* = HU = UK by (9) and (7), where H 
and K are hermitian and U unitary. Then AA“? corresponds to H? and there 
exists a unitary quaternion matrix (see (4), for example) V; = V; + jV2 so 


that V,;AA°"V;°" = D is a diagonal matrix with real elements and, conse- 
quently, if 


then VA*A*°7VCT = D + D. Since H is hermitian with non-negative real 
roots, there exists a unitary matrix W such that WHW°? = D, is diagonal 
with these non-negative real roots along the diagonal; and this W can be 
chosen in such a way that 


WH?W°? = WA*A*TW°T = D 4+ D 


so that D,? = D+ D and so D, = D, + Dz where the diagonal elements of 
D; are the positive square roots of the corresponding real roots of D. Then H 
must be of the same form as A* in (i) for if X = VW°’, then 


XWA*A*°TW°TXET = X(D + D)X°? = VA*A*OTVST = DL D 


so that X(D + D) = (D+ D)X and so X(D.+ D.) = (D2 + D:)X. From 
this, XWHW°TX*? = XD\X = D, = VHV" so that H = V°7TD,V and 
from the form of the matrices on the right side of this equality, their product 
is of type (i). 

From A* = HU, it follows that VA*V°? = VHV°TVUV" where the 
matrices have the form 


B, “I 7 hy 0 | kh “4 
re B,°} {0 D2 Us U,z)}. 
If A* is non-singular, Dz is non-singular and by equating corresponding 
block matrices, Us = U,° and U; = —U;°. 
If A* is singular (in which case there is some arbitrariness involved in the 
choice of U in HU), then D, is singular; let the first r diagonal elements be 


non-zero, the remaining being 0. From this it can be seen that D.(U; — U,°) 
= 0 and D.(U; + U2°) = 0; this means that the first r rows of U, are the 
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conjugates of the first r rows of U,; and the first r rows of U; are the negative 
conjugates of the first r rows of U;. Since VU V° is unitary, these 27 rows are 
linearly independent and by means of the », v*-basis procedure employed in §2 
above, it is seen that it is possible to complete the remaining rows of this 
matrix so that it is unitary and of the form (i). From the form of each matrix 
in the 2” X 2n matrix relation A* = HU, it follows that A can be expressed 
as required by the theorem. Since U°7A* = U°THU = K is hermitian, 
A* = UK holds (uniquely if A* is non-singular) and the theorem is true. 


5. Properties of normal quaternion matrices. If A is a normal quaternion 
matrix, it can be brought into diagonal form under a unitary similarity trans- 
formation (see (4), for example). Some further properties of normal quaternion 
matrices are verified here. 

It is known that a complex matrix A is normal if and only if A“? is a poly- 
nomial in A. If A is a normal quaternion matrix, there exists a unitary quater- 
nion matrix U such that UAU°? = D where the characteristic roots of A 
appear in the diagonal matrix D. If a, a2,..., a, are the distinct roots of A, 
the set of equations 


in x, always have solutions in the complex field. This implies that 
m—1 
D°? = > x,D’ 
j=0 
and, if x; = p,- e, 
m—1 10 m—1 
A®? = 9) psU(e *I) USA? = D) p5V A? 
j=0 j=0 


where V, = U(e*‘I)U°? is unitary and V,A = AV, for all j. If more latitude 
is allowed for the degree of the polynomial, let the distinct roots be written in 


the form a, a@2,...,@;,...,@m Where a,...,a, are the non-real complex 
roots. Let the roots a1, a2,...,@;,...-Qm,@1,...,@, be used to form the 
m + r equations 
ee m+r—1 
B; - » x 5B? 
= 


where 6, runs through the latter set of a; and &;; in this case the x, will all be 
real and it follows that: 


THEOREM 4. A quaternion matrix A is normal if and only if A°? is a poly- 
nomial in A with real coefficients. 


The following theorem will now be shown to hold as in the complex case: 


THEOREM 5. Two normal quaternion matrices A and B are commutative 
if and only tf they can be diagonalized by the same unitary transformation. 











MATRICES WITH QUATERNION ELEMENTS 199 


If AB = BA, let UAU°? = D where D is diagonal such that like roots are 
in consecutive order, with real roots a;,...,a, first, and complex roots 
Bi, **-* » Be (By = Yk + 15x, bx > 0) next. Let 


D 0 ond CG, —-C,° 

. C, C,° 
be the 2” X 2n complex matrices which are isomorphic to D and UBU®’, 
respectively. From the commutative property DC, = C,D and D°C, = C.D, 
and so 


C.2@ Cn+...4+¢ Cunt Cut... $+ Cra, 
C.= Cu +...+C2,,+0 +...+40, 


where D = ail, +... +a,J,+ 6:0’: +... + 8,1’, and where Ci, and C2, 
have the same order as the identity matrix J, and C’,, and the corresponding 
0 matrix in C, have the same order as the identity matrix J’, Therefore, 


UBUST = (Cyt... +CutCut...+C'%1) +7(Cat...+C2,+04+. ..+0) 
(Cur+jCa) +... +(Ciu+Ce) $F Cut...¢ Cu 


where the C’,, have only complex elements. Since UBU®? is normal, so is each 
matrix in the above direct sum; there exist, then, unitary quaternion matrices 
W, which diagonalize Cy + jC» and unitary complex matrices V, which 
diagonalize C’,, for all the above k. If V is the unitary matrix formed by 
taking the appropriate direct sum of these W, and V,, it follows that 
VUBU*TV*®? is diagonal and that VUAUS? V°? = VDV*T = D is also dia- 
gonal. The converse is immediate. 
The above generalizes as in the complex case: 


THEOREM 6. If {A,} is a set of normal quaternion matrices which commute 
in pairs, they can be diagonalized by the same unitary transformation. 


If each of the A, have a single characteristic root, a;, the theorem is true. 
If these roots are all real, the theorem is trivially true. If at least one root, say 
a, is non-real complex, let VA,V°? = aJ and VA,V°? = A’, for all other i; 
then each A’; commutes with a,J and so all A’, are normal, complex, and 
commutative in pairs, and can all be diagonalized by a complex unitary 
matrix U. Therefore, the unitary matrix UV diagonalizes all A ,. 

In general, the proof follows by induction on the order of the A ,. The theorem 
is trivially true for 1 X 1 matrices. Assume the theorem to be true for 
(n — 1) X (m — 1) matrices. It may also be assumed that there is at least one 
matrix, A ,, which has at least two distinct roots; let UA,U°" = D be diagonal 
(in the same form as D in the preceding theorem). Then each UA,U°? com- 
mutes with D, the problem is reduced to that involving matrices of order less 
than m and the theorem is true. 

The following theorems are true in the complex case (6); they are also true 
(obviously so from the isomorphism above) in the quaternion case: 











200 N. A. WIEGMANN 


THEOREM 7. A quaternion matrix A is normal if and only if its polar 
matrices commute. 


THEOREM 8. Jf A, B and AB are normal quaternion matrices, then BA is 
normal. 


THEOREM 9. If A and B are normal quaternion matrices, then AB is normal if 
and only if each of A and B commutes with the hermitian polar matrix of the other. 


6. A diagonal form under unitary equivalence transformations. It is 
also possible to bring a quaternion matrix into a real diagonal matrix under a 
unitary equivalence transformation according to the following: 


THEOREM 10. For every r X s quaternion matrix A there exist two unitary 
quaternion matrices U and V (of dimensions r Xr and s X s, respectively) 
such that UAV = D is diagonal with non-negative real roots along the diagonal. 


Let A = A; + jA2 where A; and A: are complex, as before, but r X s in 
dimension. Let C be the 2r X 2s matrix (composed of A; and Az) with complex 
elements which corresponds to A. According to a corollary due to Eckert and 
Young (2), if U is a 2r X 2r unitary matrix which diagonalizes CC°’, there 
exists a 2s X 2s unitary matrix V such that UCV = D, isa 2r X 2s diagonal 
matrix with non-negative real elements. From preceding work, this U may be 


taken as being in the form 
U; - <4 
U2 ors 


so that UCC°TUST = D, + Dy is 2r X 2r and so UCV = D+ D where D 
isr X s, where the elements are non-negative real, and where (D+D)(D+D)°? 
= D, + Dz. It remains to verify that V has the proper structure (i.e., like 
that of U). By considering the relation UC = (D + D)V°7, it follows (as 
in the proof of the polar representation above) that V has this form where some 
arbitrariness may be involved, as before, in choosing V. If the components of 
V are V; and V2, then (U; + 7U2)A(Vi+ V2) = D as required in the 
theorem. 
As in the complex case (8), it is also true that 


THEOREM 11. Jf A and B are twor X s quaternion matrices, then there exist 
two unitary quaternion matrices U and V such that UAV = D, and UBV =D; 


are complex diagonal matrices if and only if AB°? and B°TA are normal 
matrices. 


If such a U and V exist, the theorem is obviously true. 
If AB°? and B°7A are normal, but the preceding theorem U;A V; = D, is 
a non-negative real diagonal matrix and U,BV,; = C. Let 


_—D 0 fa. Be 
d,=|¢ 4 and c= [3 | 


ee a i ae 


} 
' 
y 
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where D is non-singular and B,; has the same order as D. From the given 
conditions D,C°? and C°7D, are normal; using the former, it follows that 
(B;D)(B;D)°? = 0 (where B; has quaternion elements and D is real) so that 
B,D = 0 and so B; = 0. Similarly B, = 0. Therefore DB,°7 and B,°7D are 
normal. Now the characteristic roots of DB,°7 and B,°7D are the same. (In 
the complex case, the characteristic roots of MN are the same as those of NM; 
from the isomorphism used above between m X m quaternion matrices and 
2n X 2n complex matrices, this result is seen to carry over). Therefore, from 
§5, there exists a polynomial f(x) with real coefficients such that B,D 
= f(DB,°") and DB, = f(B,°7D) and so DB, = f(B,°7D) = D-'*f(DB,°")D 
= D"'B,DD or D*B, = B,D*. Since D has positive diagonal elements, 
DB; = B,D. Since DB,°TB,D = B,D . DB,°", then B,°TB, = B,B,° and 
B, is a normal quaternion matrix which commutes with the (normal) real 
diagonal matrix D. There exists a quaternion unitary matrix W, which 
diagonalizes each simultaneously; there also exist unitary matrices W, and Ws; 
so that W.B,W; is a real diagonal matrix. By multiplying D; and C, each on 
the left and right, respectively, by the matrices 


W, O W,°? O 
0 We)’ 0 W;)’ 


the theorem follows. 
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A THEOREM CONCERNING THREE FIELDS 
I. N. HERSTEIN 


Several authors (1; 2; 3; 4; 5; 6) have recently studied the existence and 
non-existence of certain types of extensions of a given field. In this note we 
prove a theorem closely related to these results which, in a sense, contains 
essential portions of each of these. We prove the 


THEOREM. Let F C K CL be three fields (where we assume these inclusions 
all to be proper). Suppose that for every element x in L there exists a nontrivial 
polynomial f,(t) in the variable t with coefficients in F (and which depend on x) 
such that the element f,(x) is in K. Then either 

(a) L is purely inseparable over K, or 

(b) L, and so K, is algebraic over F. 

Proof. Suppose that L is not purely inseparable over K. Then there exists 
an element in L which is not in K which is separable over K. The set of all 
elements in L which are separable over K form a subfield L’ of L. K is of 
course contained in L’; by supposing that Z was not purely inseparable over K 
we have that L’ # K. If this subfield L’ were algebraic over F, then K would 
also be algebraic over F. This, combined with the fact that L is algebraic 
over K, would then lead to the desired conclusion that L is algebraic over F. 
So we suppose, to the contrary, that there is some element a € L’, a ¢K 
which is transcendental over F. (Being in L’, a is of course separable over K.) 
We shall show that this leads to a contradiction. 

Let Z = F(a), the set of all rational functions in a over the field F. Let 
K = LC\K. Consider the three fields F C K C L. These inclusions are all 
proper since a € L, a ¢ K, and since a is algebraic over K but not over F. 
Also, if x € Z then there is a polynomial f,(¢) with coefficients in F so that 
f(x) € K; since f,(x) € L it follows that f,(x) € K. Thus the conditions on 
the three fields F, K, L carry over to the three fields F, K, L. 

By Liiroth’s theorem K is a rational function field over Fin some s, K = F(s). 
L = R(a) is of finite degree and separable over K. Now Nagata, Nakayama 
and Tuzuku (5) have proved for this situation that there exist two distinct 
logarithmic valuations V; and V; on L which coincide on K; a simple modifi- 
cation of their argument yields that we can find such V; and V: which, in 
addition, are trivial on the field F. Thus for these two valuations we have 
the following properties: 

(1) There exists au € L,u¢K so that Vi(u) ¥ V2(u); 

(2) Vi(k) = V2(k) for all k € K; 

(3) Vila) = V2(a) = 0 for alla #0 € F. 


Received August 12, 1954. The author is greatly indebted to Professor Irving Kaplansky 
for his suggestions in the formulation in its present form of the theorem proved in this note. 
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Without loss of generality we may assume that V;(u) > 0. By hypothesis, 

k= u" + a_w"' + ...+au’ € K for some a, € F a, #0, n>r>1. 
Thus V,(k) = V2(k). 

Since V;(a,) = 0 (we only consider the non-zero multipliers that occur in 
the expression for k) and since a, ~ 0, Vi(a,u’) = rV;(u) < Vila, u™) = 
mV,(u) for m > r occurring in the expression for k with non-zero multiplier. 
Thus, since V; is a non-Archimedean valuation, it follows that V;(k) = rV,(u). 
Since 0 < V,(k) = V2(k), it follows that V;(u) > 0. Thus the argument 
used above for V; can be repeated and it follows that V2(k) = rV2(u). But 
Vi(Rk) = V2(k); therefore we are led to rVi(u) = rV2(u), which, since r #0 
implies that V;(u) = V2(u). This is contrary to the assumption that V,(u) # 
V2(u). The theorem is thereby established. 
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ON ALGEBRAIC SURFACES TERMWISE INVARIANT 
UNDER CYCLIC COLLINEATIONS 


J. C. MORELOCK anp N. C. PERRY 


1. Introduction. In algebraic geometry it is of interest to examine poly- 
nomial surfaces F which transform into themselves under the collineation T 


defined by: 
(x's, x», x's, x" 4) = (x1, Exs, E*x:s, E*x,4) 


where E”? = 1, and ? is a prime (2). One of the most obvious ways to ensure 
invariance of a surface is for each term x;°x2"x3°x,* of F to go into itself. We 
present initially, therefore, a theorem which will be useful in the study of such 
termwise invariance for polynomials of composite degree. Specializations of 


this result are then developed for the case when the polynomial degree is a 
prime. 


2. Polynomial surfaces of composite degree. Assume that each term 
%1°x2"x3°x4* of the polynomial is invariant and of degree mp. Then since the 
transform of this term under T is x;°x."x;°x,4(E°E**E™) there must hold the 
simultaneous equations b + 2c + 3d = kp and a+b+c+d= mp. Dio- 
phantine solutions in terms of two parameters m and r are readily developed as 


a=n, b=r—2n+ (Qm—k)p, c=n—2r+(k—m)p, d 


r. 


(n), 


Hence, in terms of congruence classes with respect to the modulus ?, a 
bE (r — 2n), cE (m — 2r), dE (r). 

On the other hand let a, 5, c and d (which aresuch thata + 6 + c + d = mp) 
be assumed to have membership in the classes (nm), (r — 2m), (m — 2r), (r). 


Then the term x;°x2x;°x4* transforms under T into x;*x.°E°x;°E**x,E™. 
Since 


fan) 


b + 2c + 3d = (r — 2m) — (2m — 4r) + (37) = 0 (mod p), 
and E” = 1, invariance is established. Thus we have 


THEOREM 1. A polynomial term x,°x2x;°x,*, which has degree a multiple of 
a prime, p, will go into itself under T if and only if a,b, c,d are respectively 
members of the congruence classes (mod p): (n), (r — 2n), (n — 2r), (r). 


3. Polynomial surfaces of prime degree. Of particular interest in algebraic 
geometry are surfaces where the degree of the defining polynomial is not a 
composite integer but is exactly equal to a prime p > 3. For this case m and r 
are subject to certain restrictions which we now study by setting m = 1 in 
the solution to the simultaneous equations used to develop Theorem 1. 
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Since a, b, c, d are non-negative exponents where each is at most equal to p, 
it is clear on inspecting the equation 6 + 2c + 3d = kp, that k can take on 
only the values 0, 1, 2, and 3. 

Ifk=0,6=c=r=0,andn = p. 

If k = 1, we have the four inequalities: 


(1) O<n, 

(2) O<r—2n+p, 
(3) O< m — 2r, 

(4) O<r. 


From inequality (3) we have 2r < n, and from (2) it follows that 2n < r + , 
orn < $(p + r). Thus must lie within the range defined by 2r < » < 4(p+r). 

Since 4r < 2n we may replace 2n by 4r in the inequality (2) obtaining 3r < p, 
or r < $p. Hence, to obtain an invariant term of degree p, necessary restric- 
tions on r and m are (for the case k = 1) that r first be chosen on the range 
0 <r < 4), and that then m be chosen on the range 27 < n < 3(p + 1). 

A similar analysis for the case when k = 2 demonstrates that r be chosen 
on the range 0 < r < 3p, and then m be chosen on the range 2r — p <n < }r. 
An inspection of the latter inequality shows that m is non-negative only if 
r > 4(p + 1). This suggests splitting the range of r obtained into the two 
ranges 


O<r<} and 3(6+1) <r< }p, 


with corresponding ranges for n of 
O<cn<tr and 2r7-—pcnc kh. 


Finally, then, if k = 3, b + 2c + 3d = p, hencer = pandn =b=c = 0. 
To summarize, restrictions on m and r necessary for the degree of the poly- 
nomial to be precisely p are: 


(k=0) (i) r=Q0, n=); 
(kR=1) (i) Or <9; 2 cn<¢h(ot+n); 
(k=2) (iia) OS r <4; 0<n < fr; 

(iiib) 4(6+1) <r < 9p; 2F —pen< }; 
(k=3) (iv) r=p,n=0. 


We must now demonstrate the sufficiency of the above inequalities, i.e., 
that any m and r so chosen will yield a polynomial term of degree p. We illus- 
trate the method of proof for (ii), for the case where p is of the form 6a + 1 
and K is any even member of the range 0 < r < 4). L is any corresponding 
member of the range 2r < n < $(p +r). Then 


r=%3(p-—3K—-—1), 0<K<¢}(o-1) 
and 


n=3(p+r—2L—1), 0<L< ik. 
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The exponent 6 = r — 2n + > is equal to 2L + 1 upon substitution and sim- 
plification, and in the same way the exponent c = m — 2r can be shown to 
equal (3/2)K — L. Then the polynomial term corresponding to the particular 
choice of K and L will be 


( —2Z—1).. 2L4+1_ (3/2)K—L 3K-1 
xtor" gg et, en 


The degree of this term is p, as can be readily shown by adding the exponents. 

In a like manner the sufficiency of (ii) for p = 6a + 1 and K odd can be 
readily established. A similar treatment will establish the inequalities (iiia) 
and (iiib) for all possible cases where p is either of the form 6a + 1 or 6a — 1 
and K is either even or odd. Since conditions (i) and (iv) are obvious, sufficiency 
is completely demonstrated. We summarize these results in the following: 


THEOREM 2. Necessary and sufficient conditions that a polynomial term 
%1°X_"x3°x4* of prime degree p > 3 shall be invariant under T are that: 


ace (m), bE (r —2n), cE (m—2r), de (r), 
and also n and r must satisfy the conditional inequalities: 


(a) r=0, n=); 

(b) O<r< hp, 2<cn<i(ptn); 

(1) Org }p,0O<n < fr; 

(coc) 36 +1) <r < hp, 2? -—pen< hr; 
(d) r=p,n=0. 


4. A numerical example. Theorem 2 makes possible the selection with 
certainty of the largest possible number of terms of a polynomial of degree p 
which is termwise invariant under the collineation 7. For example, by the 
laborious process of writing out all terms of a given degree and testing them 
individually for invariance it is a known result (1) that the largest such 
polynomial for degree 5 is 


Fe = x15 + x05 + xyxo*xs + xX 9x3? + x12 K 22x 4 + Xi gx, 
H+ x35 + ox g8xg + K22x gg? + yx s2x 4? + Xixexg + x45. 


Application of Theorem 2 leads to 12 pairs of values for r and m as follows: 


(a) r = 0, s = 5; 
(b) r = 0, n=0,1,2; 
r= 1, 2 = 2,3; 

(c) r = 0, n= (0; 
y= il, n = 0; 

y= 2, n=0(0,1; 
y = 3, m= 1; 

(d) r = 5, n = 0. 


The application of these pairs leads precisely to the polynomial F;. 
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5. The number of invariant terms. In conclusion we establish 


THEOREM 3. For a prime p > 3 the number of invariant terms is equal to 


$(p? + 6p + 17). 


Proof. It must first be indicated that all pairs (n, r) selected by Theorem 2 
lead to different polynomial terms. We can exclude from further consideration 
the pair (0, 0) since its appearance in Theorem 2 (b and c;) produces the poly- 
nomials x,”, and x3”. An examination of the controlling inequalities indicates 
that all other pairs (m,7r) are distinct since (b) and (c;) differ in m, (b) and 
(ce) differ in r, as do (c;) and (cz). Then since m is the exponent of x, and r is 
the exponent of x4, the distinct pairs (, r) produce distinct polynomial terms. 

To count the number of terms in (b) (Theorem 2) for p = 6a + 1 we enu- 
merate the pairs (K, L) where 0 < K < 34(p — 1) and O < L < $(38K + 1) 
as in the accompanying table. 





K 0 | 1 2 | 3 ++] 4(p — 1) 
| 

| Zz 

ise 


L| 0 | 0, 1,2 0,1,2,3| 0,1,2,3,4,5 











| 0,..-,4(2 — 1) 





The number of terms is thus 


[1+34+44+64+74+...44-1) +404) 
=[1L+44+7+... +4041] + 34+64+94+...+4(-—))]. 


By the ordinary formulas for an arithmetic series the number of terms is equal 
to (p? + 6p + 5)/12. 

By a similar analysis, the inequality (c,) contributes (p? + 6p + 5)/16 
terms if a is odd, and (p? + 6p + 9)/16 terms if a is even. In a like manner 
(ce) contributes (p? + 6p + 5)/48 if a is odd and (p? + 6p — 7)/48 terms if 
a is even. In either case the number of terms contributed by both (c;) and 


(ce) is (p? + 6p + 5)/12 since 


(p? + 6p + 5)/16 + (p? + 6p + 5)/48 = (p? + 6p + 5)/12, 
(p? + 6p + 9)/16 + (p? + 6p — 7)/48 = (p? + 6p + 5)/12. 
The total number of terms thus obtained which are invariant under T 


(when p = 6a + 1) is thus shown to be $(p? + 6p + 17). A similar analysis 
for p = 6a — 1 can be shown to yield the same result. 
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A THEOREM ON FIXED POINTS OF INVOLUTIONS IN 5s: 
DEANE MONTGOMERY ann HANS SAMELSON 


1. Introduction. Let T be an orientation-preserving homeomorphism of 
period two of the 3-sphere S* onto itself; further let T be different from the 
identity and have at least one fixed point. It has been shown by Smith (8, 
p. 162) that the set F of all fixed points of T is a simple closed curve. However, 
very little is known about the position of F in S*. There is an example (4), 
a slight variation of an example given by Bing, which shows that F may be 
wildly imbedded. Even if we assume F to be tame, as we shall do in this paper, 
it is not known whether T is equivalent to an orthogonal transformation, or 
even whether F is necessarily unknotted. Our purpose is to show, assuming T 
semi-linear, that at any rate F cannot belong to a certain class of knots in- 
cluding the ordinary cloverleaf. 

We consider S* as the unit sphere 

1 

D« 

1 
in Euclidean 4-space E*‘. ‘‘Polyhedral’”’ will always be understood in the sense 
of spherical geometry; all simplicial decompositions used are obtained by 
dividing S* into simplices by means of planes through the origin of E‘, that is, 
into spherical simplices; this is equivalent to the decompositions considered 
in (5), where S* is considered as boundary of a Euclidean 4-simplex. The 
orientation-preserving homeomorphism TJ of period two is assumed to be 
semi-linear, which by definition means that 7 is (spherically-) affine on the 
simplices of a certain simplicial decomposition = of S*; because of the periodi- 
city of T we may assume that in addition T leaves the decomposition 2 
invariant. 

The set F of fixed points of T is then a simple closed (spherical) polygon 
(8); by going to a subdivision, if necessary, F may be assumed to be a sub- 
complex of 2, that is, F is made up of one-cells of 2. 

Let C, and C, be two simple closed polygons in S*, disjoint from each other; 
C, is called a parallel knot, of order 2, of C2, if there exists a polyhedral Mébius 
band M in S* such that C, is the boundary of M, and such that C, is what we 
shall call a middle line of M, i.e., such that C, represents a generator of the 
fundamental group 7:(M) of M. (By Mébius band we mean the customary 
figure obtained by identifying two opposite sides of a rectangle, with a twist.) 
The theorem we propose to prove is as follows: 


aw te 


= 1 
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THEOREM I. Suppose the fixed point curve F of the semi-linear involution T 
of S* is a parallel knot, of order 2, of the simple closed polygon C. Then the knot 
group of C, i.e., the fundamental group x,(S* — C) of the complement of C, is 
infinite cyclic, and the linking number of F and C is +1. 


A standard torus W in S* is the figure obtained by first constructing a 
standard torus in E* (a circle D in a plane E* C E? is revolved around a line 
E' C E* which does not meet D) and then projecting E* stereographically 
onto S*. The surface W is parametrized by using an angular coordinate r; 
on D and an angular coordinate r, for the rotation around E'. If p,q are 
relatively prime natural numbers then the curve in S* given by 

r, = 2rpt, rr. = 2rqt, O<ti<¢l 
is called the standard torus knot (p,q). A simplicial standard torus knot 
(p, g) is defined to be a sufficiently close polygonal approximation (obtained 
by selecting valuesO = fh) <4; <... < t, = 1 with max |t, — ¢,_,| sufficiently 
small in an obvious sense and replacing the segment corresponding to [t,—:, ¢,] 
of the original torus knot by the spherical segment connecting the end points). 


CorOLuarRY. If the fixed point set F of T is a simplicial standard torus knot 
(p, 2), then p = 1, which implies that F is not knotted. 


This follows from Theorem I since the torus knot (p, 2) is a parallel knot, of 
order 2, of the center line of the torus. It is necessary to observe that all 
requirements are satisfied simplicially but this is not very difficult to show. Note 
that the torus knot (3, 2) is the cloverleaf. 

The method of proof of Theorem I has as a by-product the following 
Theorem which (we have learned) has also been proved by E. E. Moise: 


THEOREM II. Jf F is unknotted in the sense of bounding a polyhedral 2-cell, 
then T is equivalent to a rotation of S*. 


In the proofs we make use of a theorem of Alexander (1) which states that 
a polyhedral 2-sphere S? in S* separates S* into two domains A, A’ such that 
AU Sand A’ U S’ are closed 3-cells (each with boundary 5S). We also use a 
second theorem by Alexander (1) which states that a polyhedral torus B in S* 
separates S* into two domains D, D’ such that at least one of B\U D, B\ D’ 
is a solid torus, that is, homeomorphic with the product of a closed disk and a 
circumference, with B of course corresponding to the boundary. Actually 
Alexander’s proof needs a small modification to become applicable to the 
spherical polyhedra used here. However Schubert (6) has given a proof 
which is directly applicable to our case. 


2. First simplification of M /\ T(M). We now begin the proof of Theorem I 
and recall that F is boundary of a Mébius band M with middle line C. We 
can assume that M and C are subcomplexes of the 7-invariant simplicial 
decomposition = of S*. Then T(M) is another simplicial Mébius band and 
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T(M) also has F as boundary, since F is pointwise fixed. The set M (\ T(M) 
includes F but must include further points, for otherwise M\W T(M) would 
be a Klein bottle imbedded in S* without singularity and this is impossible. 

We shall show how M may be modified so that M (\ T(M) is made simpler. 
In the process M remains polyhedral, F remains the boundary, and C is not 
changed essentially. More specifically, all the Mébius bands considered in the 
modification process will satisfy the following condition. 


CONDITION (m): There exists a semi-linear homeomorphism of S*, which 
sends some middle line of the band into the curve C (of Theorem I) and leaves F 
pointwise fixed. (It is immaterial which middle line is selected because if C, 
and C; are two polygonal middle lines of M there is a semi-linear homeomor- 
phism of S* onto itself which maps M onto itself, leaves F pointwise fixed, and 
takes C, to C:. This can be shown by methods indicated in (6) and in §4. 
This fact is implicitly involved in a number of our arguments.) 

In the process, subdivisions of = may be needed; they will be introduced 
without explicit mention. The ultimate aim of the modifications is to prove 
the following: 


LemMA A. Under the hypothesis of Theorem I, there exists a simplicial 
Mobius band M, with boundary F such that 


M,(\T(M;) = FUC, 
where C, is a middle line of M, and where condition (m) is satisfied for C,. 


The proof is in several parts. The remainder of this section and the next 
section are devoted to selecting M so that M and T(M) are in general position. 


LemMMA 1. Let V be any neighborhood of F. Then there exists a simplicial 
Mobius band M’ such that: 

(1) the boundary of M’ is F; 

(2) MC\ (S* — V) = M’C\ (S* — V), that is, M and M’ coincide outside 
of V; 

(3) there is a neighborhood V' of F such that 

M'(\T(M’) -FCS*— V’, 

that is, the intersection, except for F, is outside of V'. Note that in consequence of 
(2) and with V small, a middle line of M is also one of M’, so that condition (m) 
is satisfied. 


Proof. By replacing 2, if necessary, by a subdivision we can assume the 
following: 

(a) F is a normal subcomplex, meaning that every simplex of 2, whose 
vertices belong to F, itself belongs to F; 

(b) the barycentric star St (F) is contained inside V; 

(c) St(F) is a simplicially divided solid torus whose boundary is a torus 
surface Q. 








7 


le 


1S 








——— Ee 


FIXED POINTS OF INVOLUTIONS IN S® 211 


As a matter of fact, St(F) is the union of the 3-cells, corresponding to the 
vertices of = on F in the dual subdivision =* of S*. Two such 3-cells, correspond- 
ing to neighboring vertices of F, have in common a 2-cell, the cell dual to the 
1-cell of = joining the two vertices, whereas two such 3-cells that correspond 
to non-neighboring vertices on F are disjoint. We may also assume that M 
is a normal subcomplex of ~. It is true that M (\ St(F) is the barycentric star 
of F on M. It follows that M ()\ Q is a simple closed curve which we shall call 
D, and that D and F form the boundary of the annulus M ()\ St(F); an annulus 
is the topological product of an interval J and a circle S'. As a consequence D 
is not ~0 on Q. 

We now propose to construct a polygonal simple closed curve D’ on Q such 
that 

(1) DD ~DonQ 

(2) T(D')/ OD’ =0 (0 = empty set) 

(3) D’ and F bound an annulus A in St(F) for which A (\ T(A) = F and 
A\Q = D’. For the construction of D’ we note that for each of the 3-cells 
o3, dual to the vertices of F, the boundary 2-sphere Bd ¢; intersects Q in an 
annulus. From the known behavior of periodic transformations of 2-cell and 
2-sphere (see (2) and the references therein to Brouwer and Kérekjarto) it 
follows that 7 on such an annulus Bd o;/)\Q is equivalent to the standard 
transformation of J X S', obtained by rotating S' through 180°. It is now 
easy to construct D’ as union of polygonal arcs, one arc in each annulus 
Bd o; (\ Q, connecting the two components of the boundary of the annulus. 
The existence of A follows then from the fact that each @; is the join of Bd a, 
and the vertex of F, to which it is dual. We now apply Theorem 3, p. 180 (see 
also p. 161), of Schubert (6); according to this there exists a semi-linear 
homeomorphism ¢ of S* which is the identity outside of V, maps Q into itself 
and sends D into D’. 

We now obtain the desired Mébius band M’ as the union of A and the image 
under ¢ of the part of M in S* — St(F). The requirements of Lemma 1 are 
now satisfied with V’ the interior of St(F). This completes the proof. 

We now call the M’ thus obtained again M and proceed with further modi- 
fications leading towards proving Lemma A. 


3. Reduction to a finite set of simple closed curves. Suppose that 
M (\ T(M) contains a 2-cell ¢ of 2 with vertices a, b, c; note that ¢ is disjoint 
from F by Lemma 1. Then o and T(c) have no point in common since other- 
wise there would be a fixed point of T in ¢ \U T(¢). Now a is a face of a 3-cell 
rt of = with vertices a, b, c, d. Let e be a point inside + and subdivide r into 
the join e o Bd r of e and the boundary Bd r of r. Because of the normality of 
M the intersection + ()\ F is empty; it follows again that r and T(r) are disjoint 
since otherwise there would be a fixed point of T in r\U T(r). We subdivide 
T(r) by the join T(e) o T(Bd 1); the new subdivision of S* is T-invariant. 

We now replace the cell ¢ of M by the join of e and Bd g; similarly replace 
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T(e) in T(M) by the join of T(e) and 7(Bdc). The band M’ so obtained has 
boundary F and any alteration of C has been in accord with condition (m). 
However 


M' (\ T(M’) = MC\T(M) -— [interior ¢ LU interior T(c)]; 


i.e., the interiors of the 2-cells ¢ and T(c) no longer belong to the intersection of 
M and T(M), whereas no other intersections are introduced. By a repetition 
of this process we arrive at a Mébius band M’ such that M’ (\ T(M’) contains 
no 2-cell. 

Given a 2-cell c, as in the above, we are free to make the indicated modi- 
fication on either side of c. Further if we have in the intersection a union of 
2-cells ¢; of 2 which forms a 2-cell K or an annulus R such that 


T(K) (\K = Oresp. T(R) OR = 0, 


then K or R has two well-defined “‘sides’’ and we may modify each oe, to either 
of the two sides. 

Next suppose that M (\ T(M) contains a 1-cell » = (a, 6) of = not on F, 
and that M and T(M) do not cross each other at 7 in an obvioussense. In M 
the 1-cell » lies on two 2-cells, say (a, b,c) and (a, b,¢@’ There exists a set of 
vertices dp = c,d,,...,d, = d, 1 << k such (6 hd id2),... (abdy_1d) 
are 3-cells of « and such that d;, 0 < i < k does nur to T(M). Using a 
subdivision of these cells and their images under T we may modify M to become 
M’ in accordance with condition (m) and so that 


M’' (\ T(M’) = MC\T(M) — [interior 7 U interior T(n)]. 


We omit the details of this process of pulling M and T7(M) apart. Using this 
method we obtain a band, again called M, such that M and T(M) cross along 
any l-cell in M(\ T(M) — F. 

Similar methods make it possible to obtain a Mébius band, again called M 
such that M/\ 7(M) contains no isolated points (and of course no 2-cells, 
and no 1-cells along which M and T(M) do not cross). We note in particular: 
Any component of the intersection of M and T(M) that is a 2-cell, disjoint 
from its image, can be completely removed from the intersection (by modifying 
M in accordance with condition (m)) without introducing new intersections. 
Any component of M (\ T(M) that is an annulus, disjoint from its image and 
along which M and T(M) do not cross in the obvious sense, can also be removed 
in this way. If M and T(M) do cross along such an annulus, then M can be so 
modified that in the intersection with 7(M) there appears instead of the 
annulus any preassigned simple closed polygon that represents a generator 
of the fundamental group of the annulus, e.g., either one of the boundary 
curves; the modified M and T(M) cross each other along this curve. Any 
component of the intersection that is a simple closed curve along which M 
and T(M) do not cross can be removed. 

Finally let p be a point of M, necessarily a vertex of 2, such that the order 
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of the graph M (\ T(M) at ? is greater than 2, that is, that more than two 
1-cells of M (\ T(M) end at p. Let R be the barycentric star of p, and let E; 
and E, be the intersections of M and of T(M) with the boundary 2-sphere 
Bd R of R. Clearly E, and E; are simple closed polygonal curves which cross 
each other at their points of intersection. 

The curve E; separates Bd R into two 2-cells; let K be one of these. If we 
replace R(/\ M by K and make the corresponding replacement at 7(p) on 
T(M), then on the modified M the number of points of M (\ T(M) of order 
greater than two has been reduced whereas all other properties are retained. 
We state the main result of this section as a lemma. 


LemMMA 2. The original Mébius band can be modified to a new band, again 
called M, satisfying condition (m) and such that the intersection M (\ T(M) 
consists of a finite number of simple closed curves, along which M and T(M) 
cross each other. 


4. Curves on a M@dbius band. For further simplifications we shall need 
information about the simple closed curves on a Mébius band, which we 
formulate as a lemma. 


Lemma 3. Let M be any Mobius band. Any simple closed curve C on M 
belongs to exactly one of the following three, topologically invariant, categories: 

(1) the curves which are contractible to a point; 

(2) the curves which represent a generator of the fundamental group x,(M) of 
M; we call these middle lines of M; 

(3) the curves which represent the square of a generator of 7,(M); we call these 
edge-like (the boundary of M is in this category). 

A middie line, disjoint from the boundary of M, has arbitrarily small neighbor- 
hoods on M that are Mobius bands. An edge-like curve, disjoint from the boundary 
of M, separates M into a Mébius band and an annulus; it has arbitrarily small 
neighborhoods on M homeomorphic to an annulus. 

Any two middle lines intersect. Any two disjoint edge-like curves bound an 
annulus contained in M; a middle line, disjoint from the two edge-like curves, 
does not meet the annulus bounded by them. 


We omit the proofs of these statements and only note the following: The 
Mobius band has as an orientable double covering an annulus R, represented 
by the region 1 < x? + y? < 2in an (x, y)-plane; the Mébius band is obtained 
by considering the involutory transformation a of R onto itself, given, in polar 
coordinates, by 

a(r,0) = (8 —7r,0+ 72), 


and identifying pairs of points, which correspond under a, to single points. The 
identification amounts to a map H from R to the Mébius band, which is the 
covering map. Proofs for the statements above on curves on M can be made 
by considering the inverse images under H. 
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5. Elimination of bounding curves. Using the facts of §4 on curves on a 
Mobius band we can now make further simplifications. 


LemMMA 4. The band M can be modified (consistent with condition (m)) so 
that no component of M (\ T(M) bounds a 2-cell on M or on T(M). 


For the proof we start with the Mébius band M obtained in Lemma 2. 
Let C;,...,C, be those simple closed curves in M (\ T(M), which bound 
2-cells in M. We can assume that C; is such that the 2-cell K,, bounded by it 
on M, contains no point of M (\ T(M) in its interior. We show first that C, 
also bounds a 2-cell on T7(M). If not, then it is edge-like or a middle line (in 
the sense of §4) on T(M). 

If it is edge-like on T(M), then it bounds a Mébius band M’ on T(M). The 
set M’\U K, would then be a projective plane imbedded in S*, which is well 
known to be impossible. 

If it is a middle line, then one gets a contradiction by considering the inter- 
section curves of K, and T7(M) with the boundary torus of the barycentric 
star of C,. 

We now show how to remove C, from M (\ T(M), without introducing new 
intersections (and of course still satisfying condition (m)). We show first 
that C, cannot meet 7(C;). If it did, then the two would be identical, since T 
permutes the components of M/\7T(M). Then K,\ T(K,) would be a 2- 
sphere, invariant under T. Since this 2-sphere does not meet F, the two 3-cells, 
into which S* is divided by it according to Alexander’s theorem (1), would 
have to be invariant, and would therefore contain fixed points of T in their 
interiors. But this contradicts the fact that F is connected. We have therefore 
C, (\ T(C,) = 0. As shown before, C; bounds a 2-cell K’; in T(M). We con- 
sider now two cases, 

(I): Kif\T(K’;) =0, 
(Il): KifC\ T(K’;) #0. 
We start with case (I). 
As a preliminary modification we define a set M* by 


5.1 M* = (M — T(K’;))U T(RK)). 
We have of course 
5.11 T(M*) = (T(M) — K"';)U Ki. 


M* is again a Mobius band, since we have replaced the cell 7(K’,) by the cell 
T(K;,), which has no point of M in its interior; note that 7(K,) and T(K’;) 
have the same boundary curve. Condition (m) concerning the middle line is 
still satisfied. The intersection of M* and its transform is given by 
5.2 M* (\ T(M*) 
= ((M — T(K’;)) \ (TM — K’))) U (M — T(K’) OR) 
U (T(M) — K's) O\ T(Ky)) U (Ki ON T(K))). 
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Since K, (\ T(K’,) = 0, the second term on the right is just K,; similarly 
the third term is T7(K,). As a consequence, we have 


5.3 Ki VU T(Ki) C M* O\ T(M*) C (MO T(M)) U KU T(RK)). 


The two 2-cells K, and T(K;) are disjoint; this follows from the fact that C,; 
and 7(C,) are disjoint, and that the interior of K, does not meet T(M) at 
all. By the method described in §3, we can therefore remove K, and 7(K;) 
from the intersection of M* and T(M*) without introducing new intersections, 
and arrive at a new Mdbius band M,, such that 


Mi (\ T(M,) C MM T(M) — (C,U T(C))), 


i.e., the intersection of M, and 7(M,) is contained in that of M and T(M), 
but does not contain the curves C; and 7(C,) any more; this finishes case (1). 

In case (II) we define M*, and T(M*) as in case (I) by 5.1 and 5.11. For 
the intersection M* (\ T(M*) we have again formula 5.2. It is again true 
that K, (\ T(K,) = 0. Further we now actually have K,; C T(K’;), since the 
boundary curve 7(C;) of T(K’;) is disjoint from C;, as shown above, and 
therefore also from K,. Similarly T(K,) C K’. It follows then from 5.2 that 


5.4 M* (\ T(M*) = (M — T(K"';)) OA (T(M) — K’‘)), 
and so 
5.5 M* (\ T(M*) C MM T(M) — (Ci, U T(C))) 


(note that C, is the boundary curve of K’;, and T(C;) that of 7 (K’;)), so that 
C, and T(C;) are no longer in the intersection. This finishes case (II); con- 
dition (m) is still satisfied. 

Iteration of this process must come to an end, since at each application the 
number of components of M (\ T(M) decreases, and Lemma 4 follows. 


6. Reduction to a middle line. We now take M to satisfy the condition of 
Lemma 4, and proceed with a last reduction, in order to arrive at Lemma A. 
The intersection M (\ T(M) now consists of F and a number of curves, say 
Ci,..., Cm, which are either edge-like or middle lines on M, with at most one 
in the last category (because of Lemma 3). Suppose there are actually edge- 
like curves in this collection. One concludes, with the help of Lemma 3, that 
there is one of these, which we can assume to be C;, such that the annulus R 
bounded on M by F and C, contains no point of M (\ T(M) in its interior. 
We show that C; cannot be invariant under 7, or, which amounts to the same, 
that C; (\ T(C,) = 0. If C; = T(C,), then RU T(R) would be a T-invariant 
torus, which separates S* into two domains. From the action of T near F it 
follows that T interchanges the two domains. On the other hand, from the 
action of T near C; we see that T cannot interchange the two domains. This 
contradiction establishes our assertion. 

The curve 7(C;) is therefore one of C2, ..., C,; we claim that it is edge-like 
on M. If it were a middle line, we would get a contradiction by considering 
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the intersection curves of M and 7 (M) with the boundary of its barycentric 
neighborhood, as in the reasoning of §5; recall that T(C,) is edge-like on T(M). 
In particular, the number of edge-like curves among C;,..., C, is >1. 

With the help of Lemma 3 we see that there is a curve in M/\T(M), 
edge-like on M and on T(M), which we can call C2, such that the interior of 
the annulus R; bounded by C; and C, on M does not meet 7(M). Let R’; be 
the annulus bounded by C,; and C, on 7(M); the interior of R’; may meet M. 
We have two possibilities: 

(A) C,; = T(C,), 
(B) C, # T(C,). 


In case (A) we have C; = T(C:2), R’; = T(R:), Ri = T(R’:). We define 
then 
M* = (M — R;)U T(R)), 
so that 
T(M*) = (T(M) — T(R))) U Ri. 


M* is again a Mébius band. The intersection M* (\ T(M*) is identical with 
M (\ T(M); but M* and T(M*) do not cross along C,; and C2, so that C; and 
C; can be removed from the intersection by the methods of §4; condition (m) 
is kept intact. 
In case (B) we have to distinguish two subcases: 

(I) Ri T(R’')) = 0, 

(1) Ri OT(R’) #0. 
We begin with (I). Then R; and 7(R’;) are non-intersecting annuli on M. We 
define new sets by 
6.1 M* 


6.11 T(M*) 


(M — T(R’s)U T(R)), 
(T(M) — R's) U Ri. 


Then M* is again a Mébius band; an annulus between two edge-like curves 
has been replaced by an annulus; this has no effect on the middle lines, so that 
condition (m) is satisfied. For the intersection with 7(M*) we have the 
formula 
6.2 M* (\ T(M*) 

= (M — T(R'1)) O (TY) — R'D) U (M — T(R')) OR: 

U (TIM) — RD) OT(RY UR OT(R). 

Because of (1) the second and third term are just R; and T(R;). It follows that 
6.3 M* (\ T(M*) = (M — T(R’'s:)) O (TCM) — BR’) U 21 U T(R)). 


As in §5 we see that R; and T(R;) are disjoint: the T-image of the boundary 
of R; does not meet R, since we have case (I), and the interior of R; does 
not meet 7(M) at all. We are therefore in the situation of §3: R; and T(R;) are 
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disjoint components of M* (\ T(M*). We can therefore replace M* by another 
permissible Mébius band such that either R; and 7(R;) are absent from the 
intersection with the 7-transform (this if M* and 7(M*) do not cross along 
R,) or R; is replaced by C2, and T(R;) by T(C.) (this if M* and T(M*) do 
cross along R,). In either case C; and 7(C,) have been removed from the 
intersection. 

We come to case (II). T(R’:) is an annulus on M, bounded by 7(C;) and 
T(C.). Because of our assumptions 7(C;) is different from C, and C:. We show 
first that we cannot have C, = T(C.). We assume then for the moment that 
C; is invariant. Let R, be the annulus bounded by F and C, on M; its boundary 
is then invariant. If the interiors of R, and 7(R:) meet, this can happen only 
along C,, so that then C; would be 7-invariant; but this we have shown to be 
impossible. It follows that R.\V T(R:) is a T-invariant torus. The reasoning 
applied to R UV T(R) in the beginning of this section applies here too, and shows 
that C, is different from 7(C,). It follows that the boundary of 7(R’;) is 
disjoint from R;. Since R, is connected, it would have to be contained in the 
interior of 7(R’,). But this is impossible, since no component of M (\ T(M) 
lies in the annulus R between F and C,; on M, and so case (II) cannot occur at 
all. 

The processes described in this section can be applied as long as there are 
edge-like curves, different from F, in the intersection of the band and its 
transform. By a finite number of steps we arrive therefore at a Mébius band 
M, satisfying Lemma A; as noted earlier, the set M; (\ T(M,) — F cannot 
be empty, and on the other hand, two middle lines on a Mébius band are 
never disjoint, by Lemma 3, so that exactly one middle line will be left over 
in the intersection (besides F). 


7. The proof of Theorem I. The knot groups of the original curve C 
and of the curve C; of Lemma A are isomorphic, and the linking numbers of F 
with C and C, are equal, because of condition (m). It is therefore sufficient to 
prove Theorem I for C; instead of C. We return to the notation M for the 
Mobius band, obtained in Lemma A, and C for the single (invariant) curve 
constituting M\ T(M) — F. We denote the polyhedron M\U T(M) by S; 
it is clear that T(.S) = S. The second homology group H;2(S) is infinite cyclic 
and the generating 2-cycle contains all 2-cells, properly oriented, with co- 
efficient +1. (In fact S can be described as obtained from a torus W; = S'S! 
by identifying pairs of antipodal points on some generating circle p X S'.) 

It follows from Alexander's duality theorem that S* — S has two compon- 
ents which we call A; and A». We have Az = T(A;), Ai = T(Az2). The reason 
is that 7, at any point of F, at the same time reverses the orientation of S 
and preserves the orientation of the 3-sphere; it interchanges therefore the 
two domains into which S separates the 3-sphere locally (and globally). It is 
also true that CIA,, the closure of A,;, is A;,\U S, and that CIA; ()\ ClA, = S. 

Let N be the closed barycentric neighborhood of the middle curve C with 
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respect to the triangulation of S* reached after the various modifications. Then 
N is a solid torus and its boundary BdN = B is a torus. 

One shows with elementary deformations that the three sets S*' — N, the 
closure of S* — N, and S* — C have isomorphic fundamental groups, and that 
in fact the inclusion maps induce these isomorphisms. The intersection 


P=N(\M 


is the closed barycentric neighborhood of C on M. It is a Mébius band with 
C as middle line; similar remarks apply to 


Q = NAT(M). 


It can be shown by elementary constructions that the following facts hold: 

The set N — (PU Q) consists of two connected sets N,, Nz which are the 
intersections of N with A, As. 

The boundary curves BdP, BdQ of P and Q, which lie on B, separate B 
into sets B,, B; each homeomorphic with an open annulus, and B; = B()\ Aj; 
the closure CIB, of B,;is B;\/ BdP U Bd@. The generators of the fundamental 
group of CIB, represented by BdP or BdQ are homotopic in N to the square 
of the generator of the fundamental group of N which is in turn represented 
by C. All sets N, CLN,, P, Q are deformation retractable onto C; also CIA, isa 
deformation retract of ClA,\U N. Note that N and B are invariant under T 
and that P and Q, BdP and BdQ, N; and Ne, B,; and Bz are interchanged by T. 

Finally let 

X = Ci(S — (PU Q)). 


This is an annulus for which F, the fixed point curve of 7, is a generator of the 
fundamental group. We shall now prove the first half of Theorem I. 


PRroposiTION 1. The fundamental group of S* — C or of Cl(S* — N) is 
infinite cyclic. 


Proof. We form X U B,, which by construction is a torus; it is also the 
boundary of each of the polyhedra N VU CIA,2 and CI(A; — N). By Alexan- 
der’s theorem (1) one of these two is homeomorphic with a solid torus. We 
consider two cases. 

(a) Suppose that Cl(A, — N) is a solid torus. Then Cl(A2 — N) is alsoa 
solid torus, as 7-image of Cl(A, — N). The union of CIl(A; — N) and 
Cl(A2 — N) is Cl(S* — N); the intersection of Cl(A; — N) and Cl(A2 — N) 
is X. To compute x; Cl(S* — N), we use the addition theorem (7, p. 177), 
according to which x; Cl(S* — N) is the free product of r; Cl(A; — N) and 
x, Cl(A2 — N) with the additional relations obtained by equating elements 
which correspond to the same element of +,(X). Let g; and ge be the generators 
of the (infinite cyclic) groups #; Cl(A; — N) and x; Cl(A2 — N); F, considered 
as a curve in Cl(A; — N), represents some power g,” of g:. Since T(F) = F 
and 7(g:) = ge, there will be only one new relation, namely, g;" = g.". The 








°c ee 





~~ 


FIXED POINTS OF INVOLUTIONS IN S* 219 


homology group H,CI(S* — N) is then the abelian group with two generators 
yi and 2 and the relation my, = my:. Because of the Alexander duality 
theorem this group must be infinite cyclic; it follows that m = +1. But this 
clearly means that x; Cl(S* — N) is infinite cyclic, and that F represents a 
generator. 

(b) Suppose that CIA, \U N is a solid torus. We show first that the curve C 
represents a generator of r,(ClA:\V NV). For the proof we can consider ClA, 
instead of CLA; LU N, since it is a deformation retract of the latter. C represents 
some power #* of a generator 8 of r:ClA2; we wish toshow k = +1, and assume 
temporarily that this is not so. It follows, going to the homology, that C is 
homologous to 0 mod k& in CIA». Applying T, one gets the same behavior in 
CIA,. On the other hand, from the explicit structure of S = CIA; (\ ClA2 we 
see that C is not homologous to 0 mod & on S. But this leads to a contradiction 
with the Mayer-Vietoris theorem (3) for the decomposition S* = CIA, ClAg, 
since H,(S*) = 0. It follows that k = +1, as we claimed. 

We now represent CIA,\U N as union of the two polyhedra Cl(A, — N) 
and N, whose intersection is CIB, Let y denote the generator of 2,(N), 
represented by C. A generator of 2:(CIB;), e.g., BdP, represents then the 
element y? in 7,(V), and a certain element a in 7,;Cl(A, — N). The group 
m,(ClA2U N) is obtained, according to the addition theorem used above, by 
adding the new generator y to the generators of r;Cl(A2, — N) and adding the 
relation a = y? to the relations in 7,;Cl(A2 — N). Since y is represented by C, 
it follows that y is a generator of the group 7:(Cl4,\U N), which is infinite 
cyclic, since ClA;\ N is a solid torus. Consequently the element a of 
m,Cl(A» — N) is also of infinite order, and we see that 2;(ClA2\ N) is the 
free product of 2;Cl(A2: — N) and 2,(N) with amalgamated subgroups {*} 
and {a} in the sense of Schreier (5). It follows from this theory that r;Cl(A»— NV) 
is isomorphically contained in r,(ClA:\V N), and is therefore itself infinite 
cyclic. But then the argument of (a) can be used to prove Proposition 1. 

We now come to the second part of Theorem I: 


PROPOSITION 2. The linking number of F and C is +1. 


Proof. The reasoning of (a) and (b) above showed that the groups 
mCl(A,; —.N) and 2;Cl(S* — N) are infinite cyclic, with F representing a 
generator of #;Cl(S* — N) and so also of 4;(S* — C). Since obviously there are 
elements in 2;(S*? — C), whose linking number with C is +1, it follows that 
F must also have this property, and Theorem I is proved. 


8. The proof of Theorem II. Again 7 is an involution of S* which pre- 
serves orientation with fixed point curve F; now F is unknotted. This hypo- 
thesis means that F bounds a cell N where N is a polyhedron of an invariant 
subdivision of S*. Earlier arguments show: 

(1) that N may be modified (§3) so that N (\ TN consists of a finite set of 
simple closed curves; 
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(2) that by a further reduction (§5) it can be arranged that 
N(\TN = F. 


Then N U TN is a 2-sphere (polyhedral) and S* is the union of two closed 
3-cells, L and T(L), each having for boundary the 2-sphere N\U TN. Let 
T» be the rotation of S*, of period 2, given by x1; — —x1, x2 > —X2, X3 — Xz, 
x4 — x4. The fixed point curve Fp is given by x; = x2 = 0. An invariant sphere 
is given by x; = 0; it is divided by Fy into two 2-cells No and To(No), given 
by x1 = 0, x2 > 0, respectively x. < 0. The whole sphere S? is the union of the 
two 3-cells Ly and Ty(Lo), given by x2 > 0, respectively x. < 0, each having 
for boundary the sphere No \U To(No). 

We begin now by setting up a homeomorphism ¢ between N and Np, arbi- 
trarily chosen. This can be extended to a homeomorphism ¢ between NV \U T(N) 
and No To(No) by defining ¢ = T)-¢-T on T(N). The homeomorphism ¢ 
between NV \U T(N) and No \U To(No) can be extended to a homeomorphism ¢ 
between Z and Ly». This in turn can be extended to a homeomorphism of S* 
with itself by defining ¢ = T)-¢-T on T(L). The ¢ so constructed satisfies 
@ = Ty-¢-T, or T = ¢'-To-¢, which shows that our involution T is equi- 
valent to the rotation 7». 
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FULL INDIVIDUAL AND CLASS 
DIFFERENTIATION THEOREMS IN THEIR RELATIONS 
TO HALO AND VITALI PROPERTIES 


C. A. HAYES, Jr. and C. Y. PAUC 


Introduction. In his article (22), de Possel laid the foundations for an 
abstract theory of differentiation of set functions, the term “‘abstract’’ being 
meant in the sense of Fréchet-Nikodym, that is, without reference to a 
euclidean, metric, or topological background. In 1.1, we adopt, substantially, 
his notion of derivation basis. De Possel considered two Vitali properties for a 
derivation basis. The strong or classical Vitali property asserts the existence 
of an enumerable disjointed p.p. covering family; it implies the full differentia- 
tion theorem for integrals, that is, the existence almost everywhere of the 
derivative and its equality with a Radon-Nikodym integrand. The weak 
Vitali property asserts the existence of a p.p. covering family with arbitrarily 
small overlap; it is equivalent to the density property or the full differentiability 
of Lipschitzian integrals. One of us, in (10) and (11), introduced in Morse’s 
setting of (14), two variations of the weak Vitali property. In the pseudo- 
strength the overlap refers to any Radon measure; the 2-overlap property 
(p > 1) involves the pth power of the excess of covering function. The pseudo- 
strength implies the existence almost everywhere of the derivative of any 
Radon measure. The 2®-overlap property does the same for the integrals of 
¥@-functions (p~' + q-' = 1). In §1, these Vitali properties are transferred 
to a gencral derivation basis B. The missing topology is replaced by the 
pretopology (21) which is derived from B. Individual and class differentiation 
theorems are established for integrals and Radon measures. The technique at 
first follows de Possel’s trend. Section 2 deals with the following converse 
problem: Do full differentiation assertions for sigma-additive set functions 
imply covering properties of Vitali types? It suffices to refer to the proof of 
the Zygmund theorem in (29), and of the Zygmund-Marcinkiewitz-Jessen 
theorem in (25) concerning the interval basis, to realize that not all differentia- 
tion proofs rest on Vitali properties. By an adaptation of de Possel’s proof of 
his equivalence theorem, we show that the full differentiability of integrals 
or Radon measure is equivalent to a Vitali property (2.2). Here again we 
proceed from individual to class assumptions. We prove (2.4) that the full 
differentiability of integrals of 2“-functions (¢g > 1) implies the 2@”-overlap 
property for p’ < p, when B is a D-basis. Thus the interval basis possesses 
the 2®-overlap property for any (finite) » > 1. In the classical proof by 
Carathéodory of the Lebesgue differentiation theorem for the cube basis, 
the preliminary Vitali theorem is deduced from a halo property of cubes, 
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namely: If for any cube Vo (mucleus), H(V>) (halo) denotes the union of those 
cubes V which are not greater than V, and intersect Vo, then the dilatation, 
that is, the ratio of the measure of the halo to the measure of Vo, is uniformly 
bounded for all Vo, in fact equals 3", where m denotes the dimension of the 
euclidean space. Halo properties differ by the requirements for the B-sets 
constituting the halo, mainly by the incidence requirements; in the example 
just given, the non-vacuity of V-V» was demanded. In (1), Busemann and 
Feller gave, for the special euclidean bases considered by them, a weak halo 
property equivalent to the density property. In 2.5, we give an individual 
differentiability criterion of Busemann-Feller type, thus shedding light on a 
second converse problem: Do full differentiation properties for o-additive 
set functions imply halo properties? In this connection we mention that a halo 
property of Busemann-Feller type creeps into the proof of 2.4. Morse, in his 
fundamental memoir (14), formulated halo conditions, securing the strong 
Vitali property for his blankets. He assumes that the B-sets are closed; but he 
also shows that his differentiation theorems remain valid when this assumption 
is dropped. In §3 we prove that in our setting, the pointwise halo condition 
implies the Vitali property for integrals or Radon measures (3.2). We give 
two examples (3.3 and 3.4), where the surrender of the closeness of the B-sets 
leads to the substitution in the assertions of the new Vitali property (pseudo- 
strength) in place of the strong Vitali property. In §4 we tackle the differentia- 
tion of functions \ defined on the B-sets. Our main tool is the Vitali integration 
to transform the “‘interval’’ function \ into a set function which is expected 
to turn out to be an integral y on the measurable sets. When this proves true, 
then the differentiability study of \ reduces to that of ¥ and our methods of 
§1 are applicable. Our results contain as a special case those published by 
Morse in (14) on the differentiation of addivelous functions. The authors wish 
to acknowledge with thanks helpful suggestions made by K. O. Househam 
in the course of many discussions. 


§1. DIFFERENTIATION OF o-ADDITIVE SET FUNCTIONS UNDER COVERING 
ASSUMPTIONS OF VITALI TYPE 


1.1. Setting. R denotes a set of points, which is our universe. S denotes the 
Boolean o-algebra! of all subsets of R. 

For two sets X and Y belonging to S, X D Y means ordinary inclusion, 
permitting the equality X = FY. 

We use both the lattice-theoretical symbols LU, (, U, fi, and the algebraic 
symbols +, —, and ., in Stone’s sense. However, we generally use the latter 
only when Stone’s and Hausdorff’s (set-theoretical) meaning coincide. 

M denotes a Boolean o-algebra of subsets of R with R as unit; wu represents a 
fixed o-finite measure defined on M;; ,* is the completion in S of yu, defined on 


M*. Also, @ represents the outer measure derived from yz (or, equivalently, 


1Definition of Boolean e-algebras and other related terms may be found in (4, pp. 19-26). 
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from u*), defined on S, namely a (S) = inf w (M), where the infimum is taken 
over all sets M such that SC M and M € M. Similarly, we define 2 on S 
by a(S) = sup u(M), where the supremum is taken over all sets M such that 
S> M and M € M. 

N denotes the family of the u-nullsets, which is a c-ideal in M (regarded as a 
Boolean o-ring); N* is the family of the u*-nullsets, which is a a-ideal in S 
(envisaged as a Boolean o-ring). 

By X D Y (mod N) we shall mean Y — X-Y € N; X = Y (mod N) will 
be understood to mean that Stone’s difference X — Y = [(X — X-Y) + 
(Y — X-Y)] EN. 

For S € S, a u-cover § of S is any M-set for which § D S, and g(S-M) = 
u(S-M) for any M € M. Similarly (3, p. 68), a u-kernel S of S is any M-set 
such that S C S, and f(S-M) = »w(S-M) for any M € M. 

Two sets S’ and S” are said to be u*-entangled if they have positive outer 
measure and common yu-cover. 

We define a derivation basis B as follows. We assume that to each point x 
of a fixed subset E of R, there correspond sequences, in the sense of Moore- 
Smith, of M-sets of finite positive measure, called constituents, which are said 
to converge to x, and are denoted generically by M,(x). Further, we assume 
de Possel’s heredity (or Fréchet’s convergence) axiom*; namely, every (co- 
final) subsequence of an x-converging sequence itself converges to x. The 
family of the sequences M,(x) is our derivation basis B. The elements of B 
are thus converging sequences, together with corresponding convergence 
points. (This notion involves a basic measure yw. The correspondence of 
converging sequences to points is called prebasis by Haupt and Pauc in (9). 
A prebasis defines a pretopology (21, §2). Some pretopological notions involve 
a a-ideal of nullsets). The definition just given does not exclude the possibility 
that two distinct points correspond to the same converging sequence. We 
denote by D the family of sets occurring in the sequences M,(x) for all 
x € E. If \ isa numerical function defined on the D-sets, and x € E, then we 
define 

A(M,(x)) 


e , 
D (x) = sup] tim sup ALC) | , 


where the expression in brackets denotes the limit superior for any one 
x-converging sequence M(x), and the supremum is taken among all sequences 
converging to x. In exactly similar fashion we define 


— ACA (2)) | 

Dz (x) = in| tim inf u(M.(x)) 3° 

We call D*\(x) and Dsd(x) the upper and lower B-derivates at x, respectively. 
If D*\(x) = Dsd(x) (finite or infinite), we say that the B-derivative D\(x) = 





*This is introduced in (22, p. 397). The limitation to ordinary sequences i = 1, 2,... is 
irrelevant throughout de Possel’s paper. 
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D*y(x) = Ded(x) exists at x, or that d is B-differentiable at x. If the sequences 
M(x) are subsequences of one universal sequence (14), then we can drop 
the prefixes “sup” and “‘inf’’ in the expressions for D*\(x) and Dsd(x). 

By subbasis of B, we mean any subfamily B* of B, containing all subse- 
quences of any of its sequences, retaining the corresponding convergence 
points. The family of the constituents occurring in the B*-sequences is called 
the spread of B*; the set of points D(B*), each of which is a convergence point 
of at least one B*-sequence, is called the domain of B*. The spread V = V(X) 
of any subbasis B* with D(B*) > X (mod N*) is called a B-fine covering of 
X. A B-fine covering V = V(X) of a set X may also be defined as a family 
of constituents containing, for almost every x (that is, everywhere but on an 
N*-set) in X, the sets of at least one sequence M, (x). 

The importance of the latter notion for the theory of differentiation results 
from the following considerations. If X C [D*\ > a], then the family of those 
constituents M satisfying \4(M) > au(M) is a B-fine covering of X. The 
same is true if ““D*” is replaced by “Ds” and ““>"’ by ‘“‘<”’ in the preceding 
sentence. 

In the second definition of a B-fine covering, the requirement of the existence 
of at least one sequence M,(x) may be replaced by the following stronger one: 
Every x-converging sequence admits of a subsequence, the sets to which 
belong to V. When this condition holds, we shall say (21, p. 74) that V is a 
full B-fine covering of X. This new requirement is equivalent to the apparently 
stronger one: For every x-converging sequence S consisting of the sets M, 
there exists an index «’ = «’ (S) such that « > «’ implies M, € V. The inter- 
section of two full B-fine coverings of X is again a full B-fine covering of X; 
the intersection of a B-fine covering of X and a full B-fine covering of X is a 
B-fine covering of X. 

With the same notation as above, the family of those constituents M 
satisfying \(M) > au(M) is a full B-fine covering of any set X C [Dsd > a]. 
The same is true if “Ds” and “>” are replaced by ‘‘D*” and “‘ <”’, respectively, 
in the foregoing sentence. 

A point x is termed totally interior (with respect to B) to a subset X of R, 
if, for every x-converging sequence S consisting of the sets M,, there exists 
(18) an index «’ = «’ (S,x), such that « > «’ implies M, C X. We represent 
by I(X) the set of points x which are totally interior to X. J(X) need not be a 
subset of X. (In the case of a blanket F, J(X) is FO X in Morse’s notation 
(14, p. 217).) If G is such a subset of R that E-G C I(G), (mod N*), then G 
is called an external D-open set (with respect to B and N*). We use G as a 
generic name; G will denote their family. D refers to Denjoy, who introduc- 
ed the internal D-open sets under the name ensembles-enveloppes, for his special 
bases, and used them as approximation sets (2; 21, p. 84). 

From the above follows the G-pruning principle: If V is a B-fine covering 
of X, and if the external D-open set G includes X (mod N*), then the family 
V,, of the V-constituents in G is still a B-fine covering of X. 
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Remarks. In Morse’s differentiation theory (14), the space R is a metric 
space, with, however, the slight relaxation that two distinct points may have 
zero distance apart. R is provided with a Carathéodory outer measure ¢, 
finite on bounded sets. To each point of a subset A of R there corresponds a 
family F(x) of sets such that every (spherical) neighborhood of x includes 
an F(x)-set. The function F is called a blanket. (Blankets are special cases of 
prebases (9).) In all blankets studied by Morse and Hayes (10; 11; 12; 13; 
14), the sets occurring in the families F(x) are Borelian. In order to subsume 
Morse’s blankets under the general derivation bases, we must reduce the do- 
main A of definition of F to the set E of points x without 0-sequences, that is, 
sequences M,, M2,..., M,,... with M, € F (x), ¢(M, =0 (i = 1,2,...), 
which converge metrically to x; under Morse’s assumptions, the set A-E of 
points x with 0-sequences is a ¢-nullset (14, p. 218). Then we correlate to each 
M in F(x) the index p = p(M, x) = diameter (M U {x}), and define the 
x-converging sequences as subsequences of the universal sequence M, (x). The 
restriction of @ to the Borelian sets is taken as our fundamental measure uy. 


1.2. Comparison lemmas. For S C R, we denote by S-M the family of sets 
S:M, where M € M, and by us the restriction of 7 to S-M; thus, for M € M, 


us(S-M) = a(S-M) = u(S8-M). 


For X CS, we have ws(X) = a(X). A real-valued function h defined on S 
is said to be ws-measurable if the Lebesgue sets [kh < a] (— ~ <a < =) 
belong to S-M. 


LEMMA 1.21 We suppose that: 

(Al) f and g are real-valued functions defined on P and Q, respectively, 
whereQCPCR. 

(A’2) Whenever A and B are yu*-entangled sets of finite outer measure for 
which A U B C Q, then there exist no two numbers a and 8 such that a < 8, 
ACIf <a],andB C [g > 6]. 

Then f > g (mod N*) on Q, that is, Q-[f < g] © N*. 


Proof. We assume the assertion to be false; thus a(Q-[f < g]) > 0. There 
exist two (rational) numbers a and 8 such that a(Q-[f < a < 8 < g]) > 0. 
We take for A and B two equal subsets of positive finite outer measure of 
[f<a<8<g]-Q. Clearly A CI[f <a]-Q, BC [8 < g]-O0, A = B. Since 
(A) = @(B) > 0, we have a contradiction with (A’2). 


LEMMA 1.22. We assume that (A1) holds and in addition: 

(A”’2) Whenever A and B are any two u*-entangled sets of finite outer 
measure for which A U B C Q, then there exist no two numbers a and 8 
such that a < 8, A C [f > 6], and BC [g < a]. 

Then f < g (mod N*) on Q, that is, Q-[f > g] € N*. 


Proof. Replace f and g in Lemma 1.21 by —f and —g, respectively. 
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LEMMA 1.23. We assume (A1) holds and also: 

(A2) There exist no two yu*-entangled sets A and B of finite outer measure 
with A \U B C Q such that the convex closure of f(A) and g(B) have positive 
distance apart.* 

Then f = g (mod N*) on Q, that is, Q-[f # g] € N*; also the restriction 
f\Q of f to Q, and g, are both u* o-measurable. 


Proof. It is readily seen that (A’2) and (A’’2) together are equivalent to 
(A2); application of Lemmas 1.21 and 1.22 completes the proof of the first 


part. 
We attend to the second part. Since yu is o-finite, R =  R,, where R, € M 
and u(R,) < © for n=1,2,.... Hence Q= UQ,, where Q, = Q-R,. 


Since the u* 9,-measurability of g|Q, (restriction of g to Q,), for all m, implies 
the u* g-measurability of gj|Q = g, we can limit ourselves to the case where 
2(Q) is finite. We assume that the yu*-measurability of g does not neta; 
hence, there exists a —— number 6 such that D = le < 4] is not p*¢- 
measurable. We denote and Da u* ‘g-cover and a u* o-kernel of D, re- 
spectively. We let D’ = D — D, D” = D — D. The u*¢-non-measurability 
of D implies that ron = f(D’) and pu*o(D”) = a(D”) are both positive. 
Thus, for a suitable 8 > 6, the set S = [g > 8]-D”’ is of positive outer measure. 
The difference 
D?=D-D=D'+D" €Q-M; 


hence there exists a u* g-cover S of S which is included in D®, so that 8 = §-D’ + 
§-D’”. Since f = g (mod N*) in Q, then D = [f < 4]-O (mod N*); defining 
A =8-D’-|f < 6], then A = 8-D’ (mod WN”). Due to the definition of D”, 
ii* o(D”) = 0, thus 8-D” contains no u*g-measurable set of positive u* ¢- 
measure. Since § = A + §-D” (mod N*) and A C8, it follows that S is a 
u*o-cover for A. Let B = S. Then A and B are yu*g-entangled, hence p*- 
entangled. If a denotes a (rational) number between 6 and 8, we have A C 
lf < a], B C [g > 86], contradicting (A’2), implied by (A2). 


CoroLuary 1.24. If P = Q = R (mod N*), (Al) and (A2) imply f = g 
(mod N*) and the u*-measurability of f and g. 


Remarks. Lemmas 1.21 and 1.22 will be used when f is a Radon-Nikodym 
u*-integrand and g a derivate. They are analogous to de Possel’s lemma 
(22, p. 394). Lemma 1.23 can be used when f and g are the (extreme) derivates. 
If we know somehow that both f and g are u*-measurable, we can formulate 
(A’2) and (A’2) considering only yu*-measurable sets A and B. The uz*- 
entanglement condition then means A = B (mod N*) and u*(A) = uw*(B) > 0 


*This formulation, which may seem unnecessarily sophisticated for numerical functions, is 
intended for the more general case where f and g take their values in a separable Banach 
space. 
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1.3. The individual Vitali assumption. 


' PRELIMINARY DEFINITIONS 1.31.By M-function we shall mean a real-valued 
function defined on M; by M-measure, a non-negative o-additive M-function; 
by signed M-measure, a o-additive M-function of variable sign. 

u-finiteness means finiteness on the M-sets of finite measure. Hence, a u- 
finite y-integral is a p-integral ¥(M) = S uf (x) dy, finite on the M-sets of finite 


measure. 
We say that the property (G,) holds if and only if R is the union of enumer- 
; ably many G-sets G®, such that @(G°,) < ©,” = 1,2,.... 


If such a sequence G°, exists, then a set X is said to be bounded if it is 
included in one of the sets G°,. Thus, our notion of boundedness depends 
upon the special sequence of G-sets occurring in the formulation of (G,). 

When (G,) holds, we adopt the following definitions. A Radon y-integral 

is any (indefinite) yw-integral ¥(M) = S uf (x) du, bounded in the sets G°,; 
that is, there exists, for m = 1,2,..., a number 8(m) such that if M € M 
and M C G°,, then |y(M)! < 8(m). A Radon measure is an M-measure bounded 
in the sets G®°,; a signed Radon measure is a o-additive M-function bounded 
in the sets G°,. A o-bounded function is any real-valued function defined on R 
and bounded on each set G°,,. 
We state some useful classical decomposition theorems. Any y-finite signed 
M-measure is the sum of a u-finite integral and a finite singular part. Any 
signed Radon measure is the sum of a Radon yz-integral and a singular part. 
Also, any signed Radon measure y is the difference of two Radon measures 
y* and y~; the sum r = y¥* + qo is the fotal variation of y. If (G,) is not as- 
sumed, ‘“‘Radon”’ can be replaced by “‘u-finite’’. 

Henceforth, when any concept involving boundedness is considered, it 
will be tacitly understood that (G,) is presupposed. 








, Remarks. In the formulation of Lemmas 1.21, 1.22, and 1.23, the phrase 

“of finite outer measure”’ may be replaced by ‘‘bounded,”’ when (G,) holds. 

In the subsequent sections we state “full differentiation theorems’’ for 

functions y of the type just described, namely, theorems asserting the exis- 
) tence almost everywhere (that is, mod N*) on E of the B-derivative Dy and 
its coincidence on E with a Radon-Nikodym uy*-integrand. We avoid the use 

of such terms as “‘R-N derivative” (4, p. 133) and “‘pseudo-dérivée” (22, 


p. 396), reserving “derivate” and “‘derivative’’ for functions defined by means 
of a convergence process, either pointwise, as usual, or globally, as in (2) 
, under “L-dérivée.”” In the (G,) case such an assertion will be proved if we 


: establish it for any G®, as universe and the G°,-pruned basis as derivation 
basis. The sets G°, play the part of autonomous domains of differentiation. 
Thus, assuming (G,), we reduce the case of a finite basic measure yu, in which 
the Radon assumption on y implies y-finiteness. 

We do not assume the sets G°, to be u- or u*-measurable, so the u-covers of 
subsets of G°,, in particular, of E-G°,, need not be included in G®°, (mod N*) 
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(see Proposition 1.48). The constituents of the G°,-pruned basis, being s- 
measurable, are included (mod N*) in any measure kernel of G°,. 

Actually, for our purposes, a weaker form of (G,) suffices, as follows: There 
exist enumerably many sets R®, of finite outer measure such that R = 
U I(R®,) (mod N*). This property is weaker, since J(R°,) need not be a G- 
set. A set X is then said to be bounded if for some n, X C I(R°®,). Similarly, 
R°,-pruning of a B-fine covering V of a set X means discarding all V-sets not 
included in R°,. The remaining V-sets form a B-fine covering of X-I(R°,). 


DEFINITIONS 1.32. By M-family we mean an enumerable family of sets, 
each with an associated multiplicity (27, p. 277). Equivalently, an M-family 
may be defined by any sequence of sets, the multiplicity associated with a set 
coinciding with its number of appearances in the sequence. In the latter 
formulation, abstraction is made of the order of appearance of any set. Certain 
advantages arise from the use of M-families instead of ordinary families in 
the work to follow. For instance, the frequency (defined a few lines farther on) 
is additive: thus, if E and F are M-families and G is the M-family obtained 
by uniting them, then ¢g + ¢r = ¢g. However, it is only subadditive for 
ordinary families. Also, any u-measurable function on R, taking only positive 
integral values, may be regarded as the frequency function of a measurable 
M-family covering R. Awkward limitations occur if we restrict ourselves to 
families without repetition. In natural fashion, we may define the limit of a 
sequence E,, E:,..., E,,... of M-families as the M-family E, if it exists, 
such that lim ¢g, = $e. So defined, E has an overlap (defined just below) 
which is conveniently represented by use of the Lebesgue convergence 
theorem. 

If Eis an M-family, then cE will denote the union of the sets occurring in E. 
By E-frequency $(x) at the point x we shall mean the number of E-sets 
(possibly ~) to which x belongs; by E-excess function we shall mean that 
function ¢, defined on cE by eg(x) = ¢x(x) — 1. We define 0E = [ex(x) > 0] 
= [@z(x) > 1], and call 6E the E-overlap set. 

Henceforth we assume that the E-sets belong to M. Then ¢¢g and eg are 
u-measurable. If y is any M-measure, we define the ~-overlap of E by 


ov) = fi cx(x) av. 


In case ¥(cE) is finite, we note that 


w(E, v) = px uee¥(M) — ¥(cE). 


In the particular case y = yu, the foregoing equations define the y-overlap of 
E, which is of somewhat special importance (8, p. 193). 

If X C R, Misa w-cover for X, and ¥ is any M-measure, then the y-overflow 
of E with respect to X and M is defined as ¥(cE — M-cE). If ¥ is u-absolutely 
continvous, then the quantity just defined will not depend upon the particular 
p-cover M, but will be the same for every set X, and the terminal phrase 








as 
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“and M” may be dropped. In particular, if y = wu, then u(oE — X-cE) is 
the u-overflow of E with respect to X. 

If X C R, then we define the y-defect of covering of X as the number 
p(X — X-cE), and we denote this by the notation y(E, X, uz). E is said to 
be an e-covering in measure of X if y(E, X, u) < ¢; it is said to be an 0- 
covering in measure of X if X C cE (mod N*). 


DEFINITIONS 1.33. If y denotes a non-negative M-measure, we say that 
the basis B possesses the Vitali ¥-property if, and only if, for any X C E of 
finite outer measure, any B-fine covering V of X, any u-cover M of X, and any 
«> 0, there exists an (enumerable) M-family E of V-sets such that, for 
S = cE: 

(V1) X — X-S € N* (E is an 0-covering of X); 

(V2) ¥(S — S-M) < « (the y-overflow of E with respect to X and M is 
less than e); 

(V3) w(E, ¥) < « (the y-overlap of E is less than e). 

(K. O. Househam has suggested the term y-redundancy of E with respect to 
X and M for the sum of the y-overflow and the y-overlap.) 
In case only (V1) and (V3) hold, we say that B possesses the reduced Vitali 


y-property. 


Remarks. lf B possesses the Vitali property corresponding to y, then it 
evidently possesses the Vitali property corresponding to all ¥ < y; that is, 
the Vitali y-property has a hereditary character. In particular, if y is a Radon 
or a u-finite M-measure, B possesses the Vitali property corresponding to the 
u-absolutely continuous part of y. 

Some equivalent formulations of the Vitali ¥-property are possible. The 
requirement (V1) may be replaced by an e-covering condition; simultaneously, 
““enumerable”’ may be replaced by “finite.” That such an e-covering version 
implies the original version can be shown by an exhaustion process. The 
requirement that X be of finite outer measure may be dropped. In the (G,) 
case, the phrase ‘‘of finite outer measure’’ may be replaced by ‘“‘bounded’’. 


DEFINITION 1.34. We define the upper u-approximation property of the M-sets 
by the G-sets (abbreviated (UG)) as follows: Corresponding to any M-set of 
M of finite measure, and any > 0, there exists (21, p. 83) a G-set G for which 
M C Gand a(G — M) < 7». 

We note that (UG) implies (G,). (UG) is not altered if the condition “‘of 
finite measure’’ is waived. 


PROPOSITION 1.35. If (UG) holds, y is a non-negative y-finite (resp., Radon) 
p-integral, and B possesses the reduced Vitali -property, then B enjoys the 
Vitali y-property. 


Proof. We let X denote any subset of E of finite outer measure (resp., 
bounded), V any B-fine covering of X, « any positive number. We use (UG) 
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to find a G-set G’ > X with a(G’ — X) < 1. Since y is w-absolutely contin- 
uous and W(X) is finite, there exists 7 = 7(X,¥, €) > O such that |y(X) — ¥(M)| 
<« whenever M € M, MCG’ and u(M — X) <n, where M — X denotes 
Stone’s difference. Invoking (UG), and the fact that the product of two 
D-open sets is again D-open, we find a G-set G with G’ D G D X and a(G—X) 
<1. We apply the reduced Vitali ¥-property to the G-pruned family Vg, to 
obtain an M-family E satisfying (V1) and (V3). Since the E-sets lie in G, 
we have u(S — S-X) < a(G — G-X) < ». Thus ¥(S) < ¥(X) + «, whence 
¥(S — S-X) < «€; (V2) holds, as required. 


DEFINITION 1.36. We say that Haupt’s adapiation property holds if and only 
if there exists a od-family G° of G-sets which is a Borel generator for M 
(that is, M is the smallest o6-family including G*°) (5, p. 173). 


PROPOSITION 1.37. Haupt's adaptation property implies the following 
(which includes (UG)): For any Radon measure ¥, any M-set M, and any 
e > 0, there exists a G-set G such that G > M and $(G — M) < «. 


Proof. For the case ¥(R) < @, the proof is given in (7, p. 27). We shall 
establish the theorem assuming ¥(R) = ©. We introduce the sequence 


G°,, G°s,...,G,..., associated with the property (G,), and for any set 
M € M and any positive integer n, we define ¥,(M) = $(G°,:M). 
We let €:, €2,..., €:,... denote a sequence of positive numbers whose sum 


is less than «. If M € M, then we may apply the theorem to y,, since y,(R) < ©, 
to find a G*-set G’, such that M C G’, and 

¥,(G’, — M) = ¥(G°,: (Gi, — M)) < &. 
The set G°,-G’, = G”, is thus a G-set (not necessarily a G°-set) including 
G°,-M, such that 7(G”,, — G°,-M) < «€,. We let S denote the union of the sets 
G”,.; then SD M, and 

S— M = Uc”, — Uc’.- Mc U(G"”, — G@,- M), 
hence 

v(S — M) < > W(G", — G,- M) <«. 


Since S is a G-set, the proposition is proved. 


Remarks. The property described in Proposition 1.37 is called the universal 
upper approximation property for G°-sets. It holds (16, pp. 244-245) in the 
special case where R is a metric space, y is a classical finite Radon measure, 
and G° is the family of the open sets. 


PROPOSITION 1.38. If p is a u-finite M-measure, Haupt's adaptation property 
and the reduced Vitali p-property both hold, then the Vitali ~-property holds. 


Proof. This follows closely the proof of Proposition 1.35, except that we 
take a w-cover M of X, and use Proposition 1.37 directly to find a G-set 
G>M with 9(G — M) < «. As before, we find an M-family E satisfying 
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(V1) and (V3), with members lying in G. Thus $(S — S-M) < ¥(G — M) <.«, 
and (V2) holds. 


1.4. The individual full differentiation theorem for Radon or ,-finite 
u-integrals. 

PROPOSITION 1.41. Jf p ts a non-negative Radon (or y-finite) w-integral 
f(x) du, and B possesses the Vitali u-property, then Dep > f(mod N*) on E. 


Proof. According to the Remarks following Definition 1.31, we need treat 
only the case where y is yu-finite. We shall obtain a contradiction from the 
assumed existence of two yz*-entangled subsets A and B of E of finite outer 
measure and two numbers a, 8 such that a < 8, A C [f > 8], BC [g < a], 
where g = Dey. Since [f > 8] € M, A’ = A-[f > 6] is a u-cover of A; since 
u(A) > 0, we have 
1.411 ¥(A’) > Bul(A’). 


On the other hand, the family V of the constituents V satisfying 
1.412 V(V) < ap(V) 


is a B-fine covering of B C [g < a]. Thus, by virtue of the Vitali u-property, 
for any natural number n, there exists an M-family E, of V-sets V,,, such that 


if S, = cE,, then 

1413 B-—B-S, € N’: u(S, — S,- B) < 2"; u —overlap of E, is less than 2™. 
Using 1.413 and 1.412 we obtain 

L414 ¥(B) < WSs) < DL W(Vai) < aD w(Vas) < (u(Ss) + 2), 


and lim u(S,) = »(B). Combining, we obtain 
1.415 ¥(B) < au(B), 


which, since ¥(A’) = ¥(B) and a < 8, is a contradiction of 1.411. From 
Lemma 1.22 follows the assertion f < g (mod N*). 


Remarks. If an e-covering version of the Vitali u-property is used in place 
of the 0-covering version, then the first statement in 1.413 is replaced by 
u(B — B-S,) < m, and because of the y-absolute continuity of ¥, 7, can be so 
chosen that ¥(B) < ¥(S,) + 2; 1.414 has to be altered accordingly. 

An example of a blanket possessing the Vitali u-property, and a function 
g € 2® for every p > 1, such that its integral y has Dy = © everywhere, is 
known (11, p. 293). 

PROPOSITION 1.42. Jf p is a non-negative Radon (or y-finite) u-integral 
Sf (x) du, and B possesses the Vitali ~-property, then D*y < f(mod N*) on E. 

Proof. As in the preceding proposition, we may and do assume that y 


is u-finite. We assume that A and B are two u*-entangled sets of finite outer 
measure, a and 8 two numbers such that a < 8, A C [f <a], BC [g > 8], 
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where ¢= D*y. Since [f <a] € M, A’ = A-[f <a] is a yw-cover for A. 
Since (A) > 0, we obtain 


1.421 ¥(A’) < ap(A’). 
The family V of the constituents V satisfying 
1.422 ¥(V) > Bu(V) 


is a B-fine covering of B C [g > 8]. We use the Vitali y-property to determine 
for each natural number 2 an M-family E, of V-sets V,,; such that if S, = cE,, 
then 

1.423 B— B-S,€ N’; ¥(S, — S,- B) < 2; ¥ —overlap of E, is less than 2™. 


The y-overlap condition yields 
¥(S,) > > ¥(Vas) — 2; 
hence, using 1.422, 
¥(S,) > BD u(Vas) — 2* > Bu(S,) — 2%. 


This last and 1.423 together yield, for m = 1, 2 


my eens 


1.424 ¥(B) + 2~ > Bu(B) — 2”; 


hence ¥(B) > Bu(B), which contradicts 1.421, since a < 8 and ¥(A’) = w(B). 
Thus, Lemma 1.21 applies and D*y < f (mod N*) on E. 


Remark. With the e-covering version of the Vitali ¥-property, we replace 
the first statement in 1.423 by u(B — B-S,) < 2, hence u(S,) > u(B) — 2, 
and in 1.424, we replace p(B) by »(B) — 2. 


THEOREM 1.43. If y is a non-negative Radon (or y-finite) u-integral and B 
possesses the Vitali u-property and the Vitali ~-property, then the B-derivative 
Dy exists almost everywhere on E and is equal, mod N*, to f|E, where f denotes 
any Radon-Nikodym yu*-integrand of y. 


Proof. This is an immediate consequence of Propositions 1.41 and 1.42. 


DEFINITION 1.44. An M-function y is said to be majorized or dominated 
by the M-function y° if |¥()| < ¥°(M) for every M € M. 

We note that a signed Radon measure (resp., u-finite M-measure) dominated 
by a w-integral is itself a u-integral; also a finitely additive M-function domina- 
ted by a Radon measure (resp., u-finite M-measure) is a signed Radon measure 
(resp., signed y-finite M-measure). 


THEOREM 1.45. If p° is a non-negative Radon (or y-finite) u-integral and B 
possesses the Vitali y-property and the Vitali ~°-property, then for any signed 
Radon measure (or p-finite signed M-measure) y dominated by y°; the B-derivative 
Dy exists p.p. on E and is equal to the E-restriction of a Radon-Nikod ym integrand 
of ¥. 
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Proof. This follows immediately upon decomposing y into y* and yy 
and using the hereditary character of the Vitali y-property. 


THEOREM 1.46. If ¥° is a non-negative Radon y-integral, B possesses the 
Vitali u-property and the Vitali ¥°-property, ¥ is a signed Radon measure, and 
there corresponds to each G°,, a positive finite number x(n) such that |\~(M)| < x(n): 
¥°(M) for any M-set M C G*, (¥°-Lipschitz condition), then Dy exists almost 
everywhere on E and is equal to the E-restriction of a Radon-Nikodym integrand 
of ¥. 

Proof. Apply Theorem 1.45 to each G°, used as an autonomous domain of 
B-differentiation, with «(n)-~° as majorant. 


Remark. If we know that the extreme derivates are u*-measurable, then 
from the Remarks under Corollary 1.24, it follows that Theorems 1.43, 1.45, 
and 1.46 remain valid, if, in the definition of the Vitali property, X is taken 


from M. 


DEFINITION 1.47. The special case of the Vitali property, wherein ¥ = yz, 
is the so-called weak Vitali property, and a basis B possessing it is called a 
weak derivation basis. 


Remarks. By Theorem 1.46, such a basis differentiates (in de Possel’s 
sense) the uniformly y-Lipschitzian integrals; explicitly, if ¥ is a e-additive 
M-function for which |y(M)| < « u(M) for M € M, where « is a constant, 
then Dy exists almost everywhere on E and is equal to the E-restriction of a 
Radon-Nikodym integrand of y. In de Possel’s version, a weak derivation 
basis differentiates the integral of any essentially bounded yu-measurable 
function, and in the Radon case, the integrals of functions which are measur- 
able and esseiitially bounded on each G°,. Five equivalent properties defining 
these bases in the case E = R (mod N*) are given in (22, pp. 403-405). 


PROPOSITION 1.48. Jf each B-fine covering of any subset X of E admits an 
enumerable subfamily covering X (mod N*), then for any G-set G, we have 
E-G C G (mod N*) or equivalently E-G C G (mod N*). If in addition, E = R, 
then the G-sets are u*-measurable. 


Proof. The family W of the B-constituents included in G is a B-fine covering 
of E-G; thus there exists an M-family EC W with E-G C cE (mod N*). 
Hence E-G C cE (mod N), and EG C G (mod N*). If E = R, then GCG 
(mod N*), hence G = G (mod N*). 





1.5. The individual full differentiation theorem for Radon measures. 


Lema 1.51. Jf ~ is a Radon measure (or a y-finite M-measure), B possesses 
the Vitali p-property, Q C E, u(Q) < ~,0 <9 < @, and there exists a B-fine 
covering V of Q such that for all V-sets V, 


1.511 ¥(V) > nw(V) 
then ¥(M) > 1 a(Q) for any M € M with QC M. 
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Proof. The Remarks following Definitions 1.31 permit us to consider only 
the case of a u-finite M-measure. We may also assume u(M) < @. We let « 
denote an arbitrary positive number, let T denote a yu-cover of Q for which 
Q CTC M, and invoke the Vitali ¥-property to obtain an M-family E of 
sets V,,4 = 1,2,..., for which, putting cE = S, we have 


1.512 Q—@Q-S € N*; y-overlap of E is less than e; ¥(S — S-T) < «. 
From 1.511 and the first two conditions of 1.512 we obtain 

W(S)> DWV) —e>a- La(Vi) — «> a(S) — € > ma(Q) - «; 
this result, combined with the last inequality of 1.512, yields 


¥(M) > ¥(T) > ¥(T-S) > ¥(S) — € > n a(Q) — 2e, 


which, since ¢ is arbitrary, gives the desired relation. 


THEOREM 1.52. If y is a Radon measure (or a u-finite M-measure) and B 
possess the Vitali y- and -properties, then ~ possesses almost everywhere in E 
a B-derivative Dy which is equal to a Radon-Nikodym integrand of y. 


Proof. We decompose y into the yw-regular part y, and the yw-singular part 
¥,, denoting by Ny, an N-set on which y, is concentrated, that is, ¥,(R— No) =0. 
B possesses the Vitali u- and y-properties, hence, in accordance with the 
Remarks under Definitions 1.33, B also has the Vitali u- and y,-properties. 
Due to Theorem 1.43, we need prove only that D*y, = 0 (mod N*). 

We let A, = [D*y, > n—"]-(R — No). The family of B-constituents V for 
which ¥,(V) > n—' u(V) is a B-fine covering of A,. In accordance with Lemma 
1.51, ¥,(M) > mn“ @(A,) for any M € M with A, C M; in particular, this 
holds for M = R — No, thus 0 > n—' g@(A,), hence 2@(A,) = 0, and A, is an 
N*-set. Now 


[D"y, > 0]: (R — No) = U[D"y, > w]- (R — No) = UA,, 
which is therefore also an N*-set. Finally, 
[D*y, > 0] = [D*y, > 0]-No + [D*y, > 0]-(R — No) 
is an N*-set, and the proof is complete. 


The following results are immediate consequences of Theorems 1.52, 
1.45, and 1.46. 


THEOREM 1.53. If ~° is a Radon measure (resp., u-finite M-measure) and B 
possesses the Vitali u- and ¥°-properties, then B differentiates any signed Radon 
(resp., u-finite) M-measure dominated by °. 


THEOREM 1.54. If ~° is a Radon measure, B possesses the Vitali u- and y°- 
properties, and y is such a signed Radon measure that there corresponds to any 
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G°, @ positive (finite) number x(n) such that |\~(M)| < «(n)-¥°(M) for any 
M-set M C G°,, then B differentiates y. 


The remark following Theorem 1.46 applies here also. 


1.6. Class differentiation theorems. 


DEFINITION 1.61. If B possesses the Vitali y-property for every non- 
negative Radon (resp., u-finite) w-integral y, then we say that B has the 
Vitali property for non-negative Radon (resp., u-finite) u-integrals. 


THEOREM 1.62. Jf B has the Vitali property for non-negative Radon (resp., 
u-finite) u-integrals, then B differentiates every Radon (resp., u-finite) u-integral. 


Proof. This follows from Theorem 1.43. 


DEFINITION 1.63. If B possesses the Vitali y-property for every Radon 
(resp., u-finite) M-measure y, then we say that B has the Vitali property for 
Radon (resp., u-finite) M-measures. 


THEOREM 1.64. Jf B has the Vitali property for Radon (resp., u-finite) 
M-measures, then B differentiates every Radon (resp., u-finite) M-measure.* 


Proof. This follows from Theorem 1.52. 


Remark. De Possel (23; 24) defines a “‘systéme dérivant généralisé”’ as a 
correspondence to each point x of a filter F, of non-negative u-measurable 
summable real functions f, vanishing outside a measurable set of finite measure 
(depending on f), and with Sefdu > 0, ¥ denotes any function on M into a 
Banach space, enumerably additive and of bounded variation. The derivative 


Dy(x) is defined as 
ims( f sav / f san). 


Conditions are stated for F, to differentiate Lipschitz, u-absolutely continuous, 
and general functions y. 


DEFINITIONS 1.65. We shall introduce a chain of properties between the 
Vitali u-property and the Vitali property for non-negative Radon y-integrals, 
under the assumption that (G,) holds. We let » and g denote two numbers, 
both greater than 1, for which p~'! + q-' = 1. By uw-functions we shall mean 
those Radon u-integrals ¥ of the form ¥(M) = S uf (x) du, for M € M, where 


f is such a function that for any given positive integer n, J uel f (x) |"du is (uni- 


formly) bounded on the M-sets M included in G°,; by 2-functions we shall 
mean those functions f which are integrands of u-functions. We shall say 
that B is an S®-basis if and only if for each subset X C E of finite outer 


‘In case B is a blanket, the Vitali property for classical Radon measures is the “‘pseudo- 
strength” of (10), which also is referred to as ‘‘Vitalische-Hayes’sche Eigenschaft” in 
(21, p. 91). The existence p.p. of the derivative of any classical Radon measure is 
established in (10). 
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measure, each B-fine covering V of X, and each « > 0, there exists an M- 
family E of V-sets for which, putting cE = S, 

(1) E is an 0-covering of X; 

(II) the u-overflow of E with respect to X is less than e; 

(III) fs{ex(x)}?du < € (the 2-overlap of E is less than e). 

Statements (I), (II), and (III) are meaningful for » = 1; we accordingly 
define an S“-basis as one having properties (I), (II), and (III), with p = 1. 
We define as u”-functions all integrals of u-measurable functions which are 
essentially bounded on each set G°®,. 


Remarks. Comparison with Definitions 1.33 shows that for any p > 1, 
(1) and (II) are the same as (V1) and (V2), while (III) is at least as strong as 
(V3) for ¥ = uw; hence every S®-basis, p > 1, possesses the Vitali u-property, 
and, in accordance with the Remarks following Definition 1.47, differentiates 
the »”-functions. The following is an extension of this result. 


THEOREM 1.66. If p > 1 and B is an S®-basis, then B differentiates the u- 
functions. 

Proof. We let B denote any S®-basis. From the property (G,), it follows 
that we may restrict our proof to the case where the domain E of B lies within 
one set G°y; that is, E may be assumed to be bounded. Furthermore, it follows 
from the remarks just above, and from Theorem 1.45, that we need prove only 
that for each non-negative u-function y, defined by ¥(M) = Sad (x)-du for 
M € M, B possesses the Vitali y-property. 

Accordingly, we let X denote any subset of E (necessarily of finite outer 
measure), V any B-fine covering of X, and ¢ any positive number. We put 
G°y = G and define ¢’ as any positive number such that 


1.661 cel J ven . an) ; <«. 


From the u-absolute continuity of y on the M-subsets of G, it follows that there 
exists a positive number 7 for which 

1.662 ly(M’) — ¥(M”")| < « 

whenever »(M’ — M"”’) < 4, where M’ € M, M” € M, M’CG, M” CG, 
and M’ — M” denotes Stone’s difference. We may and do assume that 7 < ¢’. 


We may assume V to be G-pruned. We use the S®-properties of B to find 
an M-family E of V-sets for which, putting S = cE C G, we have 


1.663 X-X-SEN': u(S — S-X) <7; J leeGey?- du <a, 


Evidently E satisfies (V1) of Definition 1.33. From the first relation in 


1.663 and Proposition 1.48, we see that S C G (mod N*) and X¥ C G (mod N*); 
hence, because of 1.662, 


lv(S) — ¥(X-S)| < «¢; 
therefore ¥(S — S-X) < «, and (V2) holds. 
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Using Hdélder’s inequality, 1.661, and the last relation in 1.663, we have 


J cater av = f x(x) fle) du 


<(f teeteriran) "( f tcenitan) 
< n( fjseorran) <«. 


Hence (V3) holds, and the proof is complete. 


Remark. In (11), there is given an example of an S®)-basis (p > 1) and a 
function which is a u“”-function for each gq’, g’ < p/(p — 1), whose derivative 
is infinite everywhere. In this example, as in all counter-examples known to us 
in the theory of differentiation, a derivate is infinite on a set of positive meas- 
ure. In this connection it is interesting to observe that Zygmund’s proof (29) 
depends upon the summability of the derivates, which prevents a ‘‘flight to 
infinity”’ on a set of positive measure. 


THEOREM 1.67. In the definition of an S®)-basis, the 0-covering condition 


may be replaced by an «-covering condition; simultaneously E may be required 
to be finite. 


Proof. Since we are merely relaxing the initial definition of an S®-basis, 
we have to prove only that any S®-basis under the e-covering definition is an 
S®)-basis under the 0-covering definition. We thus assume that for any subset 
X CE of finite outer measure, any B-fine covering V of X, and any « > 0, 
there exists a finite family F of V-constituents such that 


1.671 aX —X- oF) < €; w(oF — xX: oF) < «; f fep(x)}"du < «. 
oF 


We take a subset X of E of finite outer measure, a B-fine covering of X, 
and a positive number ¢«. We choose a sequence of positive numbers, 1, 


12, - ++» +--+ Whose sum is less than e. 
We shall determine inductively a sequence of finite families F,, F.,..., 
F,, ... of V-constituents, such that, for » = 1,2,...: 


(a) F.CF.C...CF,; 
(b) u(X = X - oF,) < Mn; 


(c) u(oF, — X-oF,) < aT = $3 


(a) jf. fer, (x) "du < ET ~s 


The existence of a family F, satisfying (b), (c), and (d), for m = 1, follows 
from our hypotheses as expressed in 1.671. We assume the existence of a 
nested sequence of families F,, Fo, ... F,, satisfying (a), (b), (c), and (d), 
and proceed to find F,,4; also satisfying them. 
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We put oF, = S, X — X-S = Y; then Y = X — X:S is a w-cover for Y. 
From (d) and the fact that u(S) < © it follows that 


J tor) Pan < @; 


thus we may find a positive number y = (7,41) such that 


1.672 J tor. cerrau < tani /? 


whenever M is an M-set, M CS, and yu(M) < y. We may and do assume 
that 7 < Nn+1/ 2’: 


Again recalling 1.671, we find a finite subfamily H of V for which, putting 
oH = T, 


1.673 w(¥ —¥-T) <7; u(T-—Y-T) <7; f {en(x)}’du < +. 
T 


Noting that S-T C T — Y-T, using 1.672, and the second relation of 1.673, 
we obtain 
1.674 J, bern < ness. 

We define Fu = F, U H and let oF 41 = U.We observe that X — X-U = 
Y — Y-T and (U — X-U) C (S — X-S) + (T — Y-T), whence from 1.673 
and (c), we obtain 

n+1 


u(X —X-U) << mi WU -X-U) <i ty <het mn = 2 1% 
= 


which establishes (b) and (c) as applied to F,,4:. 
Next, 


1675 f fen nGdu= ff fered dn + ff fer...(e) Pau. 
U U-T T 
Since U — T CS, and €yn41 = €x, on U — T, then by (d), 
1.676 f fer. 4,(x)}"du < f fer, (x) }’du < fn. 
U-T Ss 


Using 1.673, 1.674, and Minkowski’s inequality, we also obtain 


J fer@rdn =f torsCe) + dul) — 11%4u 


< [ (f toe.ce)yran)” +(f tone) - vax)” [ 
. [(f torceriran)” + (f tenterr’an)” | < wet 


Putting this last inequality and 1.676 into 1.675, we establish (d) as applied 
to F441. 
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Finally, we define 


It is clear from (a), (b), (c), and (d) that E is an M-family of V-constituents 
satisfying conditions (1), (I1), and (III) of Definitions 1.65, which completes 
our proof. 


Remarks. No change in the definition of an S®-basis results if X is required 
to be a bounded subset of E. For then any subset Y C E of finite outer meas- 
ure may be decomposed into a countable sum of bounded disjoint subsets of E, 
for each of which the S®-property holds, and by the method of the preceding 
theorem a countable family F of V-sets may be found which is an 0-covering 
of Y, with 2®-overlap and (u, Y)-overflow of F both as small as desired. 

If, in the proof of Theorem 1.66, we adopt the e¢-covering version for the 
definition of S®-bases, the first relation in 1.663 is replaced by u(X—X-S) < «. 
The Vitali ¥-property is established in the ¢-covering version. 


1.7. Relation to Younovitch’s differentiation theorem. Younovitch 
(28, Theorem III) assumes that E = R, uw is complete, and y»(R) is finite. 
The sets consisting of a single point belong to M. The basis B is a special 
de Possel basis such that each point x there corresponds to one ordinary 
sequence A ,(x), A2(x),...,A,(x),... of “neighborhoods” such that x € A, (x). 
The x-converging sequences are the subsequences of the basic sequence A;(x), 


A2(x), eees A, (x), eeee 


YOUNOVITCH’S DIFFERENTIATION THEOREM 1.71. JB differentiates the 
(finite) u-integrals 1f there exists a positive constant a such that: 


(Y1) Corresponding to any set M in M of positive u-measure, any B-fine 
covering V of M, and any posiive ¢, there exists a finite disjoint subfamily V,, 
Vo,..., Vn, satisfying the inequalities 


1.711 > u(Vi — Vir M) <6, (ar. > v.) > au(M). 
i=1 i=1 
In addition, each set V;,, i = 1,2,...,m, must belong to the basic sequence 
corresponding to a point p, of M; that is, for a certain index n,, 
V,= An (p:). 


The second inequality of 1.711 may be expressed by saying that the ex- 
haustion power of the V, with respect to M is greater than a. 


Remarks. If ~ denotes a non-negative y-integral, (Y1) holds, and M, V, 
and ¢ are regarded as prescribed, then we can, by suitable finite iteration 
of the exhaustion process under y-overflow control, produce a finite system 
91, --+»Q Of points of M and indices m;,..., m, such that 
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(1) the u-defect of covering of M by the sets 
Ai = An, (q;) 


is less than e; 

(2) the ~-excess of covering of M by the sets A? is less than e; 

(3) the y-overlap of the sets A? is less than «. 

Consequently, (Y1) implies the Vitali property for integrals, (i) expressed in 
the «-covering version, (ii) with reference to anchorage points, (iii) restricted 
to M-measurable sets. As noted previously, the e-covering version with finite 
M-family E is equivalent to our original one. Actually, by infinite iteration of 
the exhaustion process outlined above, we can obtain an (enumerable) 
M-family satisfying (2) and (3), and covering M (mod N). 

Younovitch, like de Possel, formulates Vitali-covering properties with 
reference to points, thus strengthening the Vitali assumptions. Condition 
(iii), however, appears to be substantially weaker than the Vitali property 
for u-integrals. If, by any means, one can prove under Younovitch’s assump- 
tions that the (extreme) derivates are u-measurable (see Remark after Theo- 
rem 1.46), or that any B-fine covering of a set X is a B-fine covering of any 
measure cover for X, then Younovitch’s theorem follows from Theorem 1.62. 
Younovitch’s note, unfortunately, contains neither proofs nor even a hint of 
them. 


§2. THe CONVERSE PROBLEM; COVERING PROPERTIES DEDUCED FROM 
DIFFERENTIATION PROPERTIES OF o-ADDITIVE SET FUNCTIONS 


2.1. De Possel’s equivalence theorem. (22, pp. 403-405). 


DEFINITION 2.11. A derivation basis B possesses the density property if it 
differentiates the integrals of the characteristic functions of yu-measurable 
sets, that is, if, for any M-set M, the density of x, defined as the limit of 
u(M,(x)-M)/u(M,(x)), exists and equals Cy (characteristic function of M) 
almost everywhere on E. 


THEOREM 2.12. The density property and the Vitali u-property are equivalent. 


Proof. As noted in the Remarks after Definition 1.47, the Vitali u-property 
implies the density property. We have to prove the converse. 

We assume that the density property holds and let X denote a subset of 
E of finite outer measure, V a B-fine covering of X, and «€ a positive number. 
We select a, 0 < a < 1, so that 


2.121 0 < (a! — 1) w(X) <. 


If Y is any subset of X such that a(Y) > 0, then we define V(Y, a) as the 
family of V-sets V for which 


2.122 u(Y-V) > ap(V), 


and yy as the supremum of the numbers »u(V), for V € V(Y, a). 
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Now, from the density property, if @(Y) > 0, it follows that the density 
of Y equals 1 for at least one point y € Y. Hence there exists at least one 
y-converging sequence in B whose constituents belong to V and satisfy 2.122. 
The family V(Y, a) is thus non-vacuous, hence wy > 0. In case Y C X and 
a(Y) = 0, we put py = 0. 

We fix a number «x, 0 < « < 1. By the definition of uy, there exists a V-set 
Vi, such that 


w(Vi) > kux, w(X-Vi) > ap(Vi). 


We let X; = X, X_ = X; — Vi-X,. From this point we proceed inductively, 


assuming that sets V, have been defined in V for i = 1, 2,...,, satisfying 
the relations 

2.123 u(XeV)) > ap(V,), u(V,) > Kixi, 

where 


‘ 
Xr = Xi — Xi U V;. 


j=l 
In case @(X,4:1) = 0, we stop the process; in case 7(X,4:) > 0, we define a 
new V-constituent V,4; such that 
(Vert) > KMxse.s B(Xnsi* Vers) > ap(V,). 

The process just described leads to the construction of an M-family E 
consisting of a finite or infinite sequence of sets V; taken from V, satisfying 
2.123 (¢ = 1,2,...). Since also the sets X,-V, are disjoint (mod N*), we 
have 

p(X) > A(x. U v.) > | U X,: v.) 
i i 
= Duke V;) > a> u(V;); 
t 
consequently, 


2.124 > u(V,) < o*,(z. U v.). 


Putting S = cE and combining 2.121 with 2.124, we obtain 


2.125 {> a) - us) +u(S— S-X) = 2d (Vs) — u(S-X) 
< (a - 1) u(S-X) < €. 


Hence conditions (V2) and (V3) of Definition 1.33, with y = yu, hold. 

To show that (V1) holds for our family E, we note that if the sequence of 
sets V, is finite, then for some positive integer N, we have E = 
{Vi, Vo,..., Vw} and u(Xwy41) = 0. Thus a(X — X-cE) = 0, as required by 
(V1). If the sequence is infinite, then from 2.123 and 2.124 we see that 


«>, bx, < > #(V,) < au(X) < @; hence lim ux, = 0. 
t t 
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We let X,, = X — X-U V;,. Since X,, C X, and hence V(X,,, a) C V(X,, a), 
fors = 1,2,..., then 


MX¥a2 = 0. 


This means that cE = U V; > X (mod N*), as required, and the proof is 
complete. 


Remarks. lf we wish, as does de Possel, to ‘‘anchor”’ the V, to points of X, 
we can extract V, from an x,-converging sequence, whose constituents belong 
to V(X,, a), and such that x, € X,. 

As is known (25, p. 129), in any Euclidean space the interval basis possesses 
the density property, therefore, by Theorem 2.12, it is a weak derivation 
basis. There exists (17) an example of a summable function f in the plane 
whose indefinite integral is J-differentiable (strongly differentiable), although 
the integral of |f| is not. 


2.2 A necessary and sufficient condition for a weak derivation basis to 
differentiate a u-finite M-measure (Radon measure) ¥. We assume that B 
is a weak derivation basis; that is, B possesses the Vitali u-property. 

We let f denote a y-measurable, non-negative, almost everywhere finite 
function with domain R; by f, we shall mean that function for which f,(x) = 0 
if f(x) > m and f,(x) = f(x) if f(x) <_m. We further define r,(x) = f(x) — f,(x), 
and for M € M, 


y(M) = J Je) au: ta(M) = J fate) dus pn(M) = J rae) dy 


Since f, is a u-measurable bounded function, B differentiates its integral 
¥,; that is, Dy, exists almost everywhere in E and equals f,(mod N*). We 
have 

D*py = Db, + D* pn, 
hence 
D*y = f, + D*p, (mod N*) 


on E. In accordance with the definition of f, and the finiteness (mod N%*), 
we have lim f, = f almost everywhere on E. This leads to the following 
result. 


LEMMA 2.21. A necessary and sufficient condition for a weak derivation 
basis to differentiate p is 
lim [D*p, > 0] = 0. 


In particular, if p(E) is finite, this is equivalent to the condition that 
lim g[D*p, > «] = 0 for each positive «. 


CorOLLARY 2.22. If B differentiates the non-negative y-integral yp, it differ- 
entiates any M-measure y’' for which y’ < y. 





ie | 


ut 
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This result can be extended to any u-finite (resp., Radon) M-measure y. 
In fact, for M € M, 


¥(M) = ¥.(M) + J Je) a, 


where f represents the Radon-Nikodym integrand of ¥. We suppose that 
vy’ is any M-measure, ¥ < y. Then 


V(M) = v.(M) + fre du 


is the corresponding decomposition for y’. 

If No denotes an N-set for which ¥,(R — N») = 0, then y is u-absolutely 
continuous on the M-subsets of R — No, consequently, so is ¥’; therefore 
v’.(R — No) = 0, 


V'(M) = V(M-(R — No)) < ¥(M-(R — No)) = (MN), 
V(M) = WV (M-No) < ¥(M-No) = ¥,(M). 
Thus y’, and y’, are dominated by y, and y,, respectively. The assumption 
that B differentiates y means that Dy = f (mod N*) on E, hence Dy, = f 
(mod N*) on E, and Dy, = 0 (mod N*) on E. Since D*y’, < D*y,, then 
Dy’, exists and equals zero (mod N*) on E. Thus, we have the following 
general result. 


THEOREM 2.23. If a weak derivation basis B differentiates the u-finite (resp., 
Radon) M-measure y, then B differentiates any y-finite (resp., Radon) M- 
measure dominated by yp. 


COROLLARY 2.24. If a weak derivation basis differentiates the total variation 
t of a signed :-finite (Radon) M-measure yy, it differentiates y itself. 
As a special case, if the weak basis B differentiates the integral Sif (x) |du, 
where f is a o-bounded measurable function, then B differentiates Sf (x)du. 
LemMA 2.25. If the weak derivation basis B differentiates the u-finite Radon 
M-measure ¥, M € M, and tr = ¥ + u, then the r-density 
. 7(M- M,(x)) 
lim rz) 
exists almost everywhere on E and equals Cy (characteristic function of M). 


Proof. We let f denote the Radon-Nikodym integrand of y. Then, for 
M’€ M, 


tu(M"') = r(M’- M) 


¥(M’- M) + »(M’- M) 


ff) du + ¥.(M’- M) + »(M’- M) 
- f..0@) + 1) du + ¥,(M’- M) 


= J cucterre) +1) du + ¥,(M’- M), 
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where y, is the u-singular part of y. Since B differentiates r, lim r(M,(x))/ 
u(M,(x)) exists and equals f(x) + 1 for u*-almost all x € E. But B also 
differentiates ry, so that 

lim r(M,(x) - M)/u(M,(x)) 


exists and, by above, is equal to Cy(x) (f(x) + 1) for u*-almost all x € E. 
Hence, by division, lim 7r(M,(x)-M)/r(M,(x)) exists and equals Cy for 
p*-almostallx € E. 


Lema 2.26. Jf B, y, and r are as in the preceding lemma, X is a subset of 
E of finite outer measure, M, is a measure-cover of X, V is a B-fine covering of 
X, € is a positive number, and 0 < a < 1, then there exists a finite or infinite 
sequence of V-sets V, for which 


2.261 U Vv, > X(mod N*), © (Ve) < r(at- U Vx) /a. 


Proof. For any M-set such that 7(M-E) > 0, we define V(r, M, a), as the 
family of V-sets V for which 


2.262 T(M-V) > ar(V), 


and u(r, M) as the supremum of the numbers u(V) for V € V (r, M,a). 
From Lemma 2.25, it follows that there is at least one point x € M-E at which 
the r-density of M equals 1, hence V(r, M, a) is non-vacuous, and u(r, M) > 0. 
In case M € M and ga(M-E) = 0, we define u(r, M) = 0. 

From this point on the proof follows closely that of Theorem 2.12, with + 
replacing » and the measure covers having to be specially selected, since r 
need not be u-absolutely continuous. By a process similar to that of Theorem 
2.12, for fixed x, 0 <« <1, we determine inductively a finite or infinite 
sequence V;, V2,... of V-sets with properties as follows. We put X; = X, 
and for any positive integer m > 1, 


n 
Xn+1 = X, — X\° U Vi, 
t=1 
M, +: denotes a measure cover of X,,4; contained in 


M, — M,° U Vi. 


t= 


If @(Xn41) > 0, then V,4: is so chosen from V that 


2.263 7 (Mais Vari) > ar(Va+1), u(Vn+1) > u(r, My+1). 


If @(Xa41) = 0, the process stops. 
Our choice of the sets M, ensures that the sets M,-V, are strictly disjoint; 
hence, using 2.263, we have 


2.264 1(Mi) > r(Mh- U Vs) > r( U mM, Vx) 
= > 7(M,- Vn) > ad (Vz), 


which is the second relation of 2.261. 
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If V, is a finite sequence, then 7(X,) = 0 holds for some integer N, and the 
first relation of 2.261 clearly holds. If V, is infinite, we let X,,. = X — X-U V,; 
we may, and do, choose a y-cover M,, of X., contained in (\ M,. From 2.263 
and 2.264 we have lim u(r, M,) = 0. Since M,, C M,, we have V(r, M., a) C 
Vir, M,, a), and u(r, M..) < u(r, M,) for m = 1,2,...; thus u(r, M.) = 0, 
and a(M,,-E) = 0. But X,, C M,-E; hence a(X..) = 0, and the first condition 
of 2.261 holds. 


THEOREM 2.27. If a weak derivation basis B differentiates the u-finite (Radon) 
M-measure y, then B possesses the Vitali ~-property. 


Proof. Taking X, M,, V, and «¢ as in the statement of Lemma 2.26, we 
select a so that 


2.271 0 < (a' — 1) r(M)) < «, 


and choose an M-family E in accordance with Lemma 2.26, satisfying 2.261. 
For S = cE, the (7, M,;)-redundancy of covering is given by 


jx 1(V_) — (5) + 1(S — S+ My) = S1(Vs) — r(S* Mi), 


which, by 2.261 and 2.271, is less than «. Thus the r-overlap of E and the 
(r, M,)-overflow of E are less than e, so that the Vitali r-property holds. 
Since y < 1, the Vitali y-property also holds. 


Remark. If desired, the sets V, may be “anchored”’ to points of X,, as in 
the de Possel theorem. 

Combining Theorem 2.27 and Theorem 1.52, we obtain the following 
criterion of differentiability of an individual M-measure. 


THEOREM 2.28. A necessary and sufficient condition for a weak derivation 
basis B to differentiate the u-finite (Radon) M-measure y is the validity of the 
Vitali y-property. 


THEOREM 2.29. The Vitali property for u-finite (resp., Radon) y-integrals is 
equivalent to the B-differentiability of every u-finite (resp., Radon) yu-integral; 
the Vitali property for w-finite (resp., Radon) M-measures is equivalent to the 
B-differentiability of every u-finite (resp., Radon) M-measure. 


Proof. This follows from Theorem 1.52, Theorem 2.12, and Theorem 
2. 


2.3. Relation to Younovitch’s equivalence theorem. We return to the setting 
of 1.7 in the following discussion. 

Younovitch formulates a Vitali u-property (Y2) exhibiting the features 
(i), (ii), and (iii) in the Remarks under 1.71, and asserts its equivalence with 
the density property. That is, under his assumptions, Younovitch proves 
Theorem 2.12 with a weakened Vitali u-property in which X is required to 
belong to M. We have been unable to prove this; otherwise, Younovitch’s 
theory would be a special instance of ours. 
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Younovitch also formulates a criterion for the differentiation of y-integrals, 
which are necessarily finite since u(R) < © for his space R. This will be 
stated after some preliminary definitions are given. 

If D is a decomposition of the space R into a sequence of disjoint u-measur- 
able sets R:, Re, ...,s0 that R = U R,, then D is Y-summable if and only if 


> vp(R,) 


is finite. For any positive integer k and any positive number e, U,(®, &, €) 
denotes the set of points x in R for which there exists a sequence 
A,, (x) 


satisfying the relation 
DL m(R,* An(x)) > (An (2)). 


Younovitch’s basis B is said to have the property (Y3) if and only if for each 
Y-summable decomposition D of R and each positive number e, 


lim {U,(®, k, «)} = 0. 


(Younovitch does not place a bar over yu, evidently regarding the bracketed 
set as u-measurable, which seems to confirm the conjecture of 1.71 that he 
establishes the u(=,*)-measurability of the derivatives of u-integrals). 

YOUNOVITCH’S CRITERION 2.31. (Y2) and (Y3) together are equivalent to 
the B-differentiability of every (finite) u-integral. 

Assuming (Y2) to be equivalent to the Vitali u-property, this result can be 
proved from the theorems of 2.1, as will now be indicated. Corresponding to 
the decomposition occurring in the formulation of (Y3), we define a function 


@ by o(x) = vy for x € R,,v = 1,2,...; @ is the frequency function of the 
M-family C consisting of the sets 
S. = U R,z 


t=» 
C is a covering of R. The Y-summability of the decomposition means the 
finite integrability of ¢ = ¢c¢ over R; conversely, any u-measurable function 
on R taking only positive integral values may be regarded as the frequency 
function of a measurable M-family covering R. 
We apply the considerations of 2.2 to f = ¢. We have 


> yu(R,~ An.(x)) = pe(An,(x)). 
Younovitch’s set U,(D, k, €) satisfies the relations 


[D*» > «] D U,(D, k, €) D [D*» > l. 


From Lemma 2.21, assuming the Vitali u-property, it follows that condition 
(Y3) is equivalent to D*y = Dey = f (mod N*) for the integrals of measurable 
frequency (or multiplicity) functions f. 
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By Theorem 2.12, the Vitali u-property is equivalent to the differentiability 
property when the measurable function f takes only a finite number of values, 
or even only the values 0 and 1. These latter functions are the characteristic 
functions of u-measurable sets. 

Combining, we see that Younovitch’s theorem amounts to the assertion 
that a necessary and sufficient condition for the validity of the differentiability 
of u-integrals is its validity for the integrals of u-measurable functions taking 
only positive integral values. 

It remains to be shown only that a weak derivation basis differentiating 
the integrals of u-measurable frequency functions differentiates all yu-integrals 
¥(M) = S af (x) dy. This follows using the decomposition ¥(M) = ¥*(M) + 
¥-(M), and for non-negative f, the representation f(x) = [n](x) + e(x), 
where [m](x) takes only positive integral values and —1 < e(x) < 0. Any 
weak derivation basis differentiates the integrals of the functions e. 


2.4. A converse theorem for bases differentiating the »“°-functions. In 
what follows, we assume that the basis B is a general derivation basis and 
that R has the property (G,). 

E denoting an M-family of B-constituents, r and a positive numbers, we 
let E(a, r) denote the family of E-sets V for which 


J ee(e)}"du > au(¥), 


and we further let o,,,(E) denote the union of the sets E(a,r). Clearly, if 


r’ >r”’, then E(a, r’) D E(a, r’’), and oa, (E) D oa,(E). 


LemMA 2.41. Jf H represents the M-family of the E-constituents V for 
which 


J tee Ge)1"du < au(V); 
that is, if H = EB — E(a, 1’), then 
wo" (H) < a> u(V), 
where w‘’t (H) denotes the 0°*-overlap of H. 
Proof. 
oH) =f fenle)) du = ff fouls) — 1d 
< J tee) — 1}"ou(x) du 


» J tee(eyi"du < a> u(V). 


In the preceding considerations, if r = 0, we shall interpret {¢2(x) — 1}’ 
as the function defined on cE, taking the value 0 if eg(x) = 0, or the value 
1 if eg(x) > 1; that is, the restriction to cE of the characteristic function 
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Cex of the E-overlap set 6E (see Definitions 1.32). E(a, 0) is the family of the 
E-sets V for which u(V-0E) > apz(V). 
The above lemma remains true when r = 0, since 


oH) = oH) = ff fonts) — 1) de 
< J Con (x) n(x) m< > J Crete) du < ad n(V). 


DEFINITION 2.42. We say that the basis B has the property (H,), for 
p > 1, if and only if for any bounded set X C E, any B-fine covering V of X, 
any 2* > a@(X), any «* > 0, and any a* > 0, there exists a finite M-family 
G of V-sets such that 


2.421 g(X —X-0G) <e; >» u(V) <2"; w(oa*, p-1(G)) < . 


Remarks. Without the third condition, we have the Vitali u-property in 
the ¢-version, and for bounded subsets of E, which, as noted earlier, is equiva- 
lent to the original definition of the Vitali u-property. 

We see that if p’ > p”, then (H,-) implies (H,--). 


Lemma 2.43. If B is an S®-basis, z > 1, and if B does not possess the pro- 
perty (H,-), where p’ > 1, then there exists a bounded set X_ C E, a bounded 
G-set Go > Xo, a B-fine covering Vo of Xo, and positive numbers ao, €9 such that 
for every M-family F of Vo-sets satisfying the relations 


2.431 g(Xo — Xo- oF) <@, wloF — Xo- oF) < eo, w”(F) <¢ oF CG, 
we have 
B{Ga,,9'-1(F)} > 2eo. 


Proof. If G is a bounded G-set, X C E-G, V is a B-fine covering of X, 
a and ¢ are both positive numbers, then we call the entity (X, G, V, a, «) 
an admissible quintuple. Since B is an S“-basis, for any such quintuple there 
exist M-familes F of V-sets for which a(X — X-cF) < ¢, u(oF — X-cF) < «, 
w(F) < ¢€ and oF C G. For such families F we thus have 


D> u(V) = w(oF) + w(F, uv) < w(oF) + wo (F) < a(X) + 2. 


For any fixed admissible quintuple, we let 7 denote the infimum, among all 
such families F, of the numbers yu(e.,)—-1(F)). It follows that if, for each 
admissible quintuple, the corresponding » were zero, then B would have the 
property (H,,), contrary to hypothesis. Thus, for some admissible quintuple 
(Xo, Go, Vo, ao, €0), the corresponding mo is a positive number, and for each 
finite M-family F of Vo-sets satisfying the relations 2.431, we have 


2.532 Boa,9-1(F)} > m0 > 0. 


Now if F is any M-family which satisfies the relations obtained from 2.431 
merely by replacing ¢) by any smaller positive number, then F necessarily 
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satisfies the unchanged relation 2.431. Thus, we may assume that ¢9 has been 
chosen so smail that 0 < €9 < 40, which, in the light of 2.432, completes the 
proof. 


Henceforth, any quintuple (Xo, Go, Vo, ao, €o) satisfying the conditions of 
Lemma 2.43 will be called a privileged quintuple. 


Lemma 2.44. If the basis B possesses the property (H,), where p > 1, then 
B is an S®)-basis. 


Proof. By virtue of the Remarks following Theorem 1.67, it suffices to 
show that the S®-properties hold when X is any bounded subset of Z. Thus, 
we take a bounded set X C E, a B-fine covering V of X, choose « > 0, and 
select 2* so that a(X) < 2* < g(X) + 4e. We let «* = }e and choose any 
positive a* with a*z* < «. 

We use the property (H,) to find a finite M-family G of V-constituents 
such that 2.421 holds. We define F as the family of G-sets V for which 


J toi an < at u(V), 


If Y = X-eG, then F covers Y — Y-o,*,-:(G). Hence, by 2.421, and the 
definition of e*, we have 


2.441 a(X —X-oF) < a(X — X-oG) + w(oatp1(G)) < 2€" = €/2; 
that is, F is an e-covering of X. 


Using Lemma 2.41, with r = » — 1, and taking account of the second 
relation in 2.421 and the choice of a*, we obtain 


wo” (F) <a® > w(V) <a*® > w(V) <a’s® <. 
ver VG 
Finally, from conditions 2.421 we have 
u(oF — X- oF) < w(oG — X- oG) = u(X — X- oG) + w(oG) — u(X) 
< u(X — X- 0G) + JF uv) - u(X)} << 


which completes the proof that B is an S®-basis. 


DEFINITION 2.45. We assume R to be a measure space as described in 1.1. 
We suppose that U is a given family of M-sets of finite positive measure, 
5 a positive finite function defined on U. We define E as the set of points x 
for which there exists at least one ordinary sequence of sets V, € U with lim 
5(V,) = 0 and x € V, for nm = 1,2,.... We define the D-basis [U, 5] by 
associating with each x € E the totality of ordinary sequences of sets M ,(x) 
(c = 1,2,...) for which x € M,(x), M,(x) € U, and lim 6(M,(x)) = 0. From 
our assumptions it follows that the domain of [U, 4] is Z and the spread is a 
subfamily of U. The function 6 is called the index of uniform contraction. 
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For any fixed » > 0, we denote by V, the subfamily of V consisting of the 
V-sets V for which 6(V) < 7. 

For a family (or an M-family) F of V-sets V, we define the 4-fineness 
or 6-norm v(F) as sup 46(V), for V € V. 


Remarks. Denjoy (2) considered bases more general than those just 
defined, insofar as the requirement “x € M,(x) for each «” is replaced by 
“‘e € E(M,) for each 1,” where E(V) is defined for each B-constituent V as a 
subset of R, not necessarily u-measurable, containing V. However, the con- 
traction requirement is lim u4(M,) = 0 (19). Nevertheless, we name the bases 
introduced here after Denjoy since his memoir points to their specific pro- 
perties. Following Haupt, they are called “‘U-Basen”’ in (21, p. 71), for reasons 
there explained. 

Once 64 is fixed, a D-subbasis of [U, 6] is uniquely defined by its spread 
T CU; its domain D[T, 4] (abbreviated D[T]) is no longer an arbitrary 
subset of D[U] as is the case with a general subbasis. For instance, if B is an 
S®-basis, the domain of any D-subbasis of B is a u*-measurable set. 

A B-fine covering V of X is characterized as a subfamily of U with D[V]D>X 
(mod N*). 


Lemma 2.46. If B is a D-basis [U, 6], p’ > 2 > 1, and B is an S®-basis 
but not an S®’)-basis, then there exists a bounded G-set Go and a set Xo, with 
Go-E D Xo, a B-fine covering Vo of Xo, positive numbers ao and &, and a se- 
quence, F,, F2,..., F,,... of finite M-families of Vo-sets for which 


2.461 lim »(F,) = 0, S, = oF, C Go; 
2.462 p(Xo—Xo° S,) <<, w(F,) < €0/2"*", pw (oa,.p°-1(Fa) > 2€0. 


Proof. By Lemma 2.44, B does not possess the property (H,’), and we may 
apply Lemma 2.43 t find a privileged quintuple (Xo, Go, Vo, ao, €0). Since B 
is an S‘-basis, we define F, as a finite M-family of V1,,-sets included in Go, 
satisfying the first two relations in 2.462. The last relation in 2.462 holds by 
our choice of a privileged quintuple, and 2.461 is clearly valid. 


Lemma 2.47. We let B denote an S‘-basis which is also a D-basis [U, 8]. 
We define po as the supremum of numbers p such that B is an S®)-basis; we 
assume that po < @. We so define qo that po! + qo“ = 1 tf fo > 1; otherwise 
go = © if po = 1. Then for any number q, 1 <q < qo, there exists a p™- 
function Yo, a positive number ao, and a subset Cy of E of positive outer measure 
such that: 


2.471 ¥o(M) = J foe) du for M € M, and nok: < ©; 
M R 

2.472 fo(x) = O foreach x € Co; 

2.473 Dv > a > Oforeachx € Cp. 
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Proof. We so define p that p-' + g-' = 1. Our hypotheses on g ensure that 
Po < p < @. In case po > 1, we clearly have 
0 < g(Po — 1) < go(Ho — 1) = po; 
hence we can choose a number p’ so that 
0<g(p'-—1)<p<?p' <P. 
Even in case fp = 1, this last inequality may be satisfied for a suitable choice 
. * either case, we so define g’ that (p’)-' + (g’)~' = 1; clearly, then, 


q <q <q. We let z denote the larger of the two numbers g(p’ — 1) and 
a 


From our assumptions, it follows that B is an S“-basis but not an S®- 
basis. Lemma 2.46 asserts the existence of a privileged quintuple (Xo, Go, Vo, 


ao, €9) and a sequence F,, F2,... F,, . . . of finite M-families of Vo-sets satisfy- 
ing 2.461 and 2.462. We let 


S, = oF,, O, = OF,(F,-overlap set), D = U On, 
HA, = Ca,.p'-1(Fn), On = H, —H,-D, Co = limsup Q,, 
€n(X) = €pn(x) if x € On, €n(x) = 0, x ¢0,. 
We have 
wD) < 2 wn) < Lor) < Dw (F) < 
Since u(H,) > 2 €o for each n, then u(Q,) > €o for all m. Since the sets Q, are 


subsets of the set Go of finite outer measure, then u(Co) > €. 
We define 


DAT — fal) = D lela", voll) = f fale) de M éM. 


Now fo is non-negative and vanishes on R — D, hence on Cy C (R — D); 
this verifies 2.472. 
From the definition of fy and Minkowski’s inequality, we have 


( focrran) ‘ x (f tacens”an) < u ( f taceriran) 


1/@ 
= > {fo (F,)}'" < > (<5) vt aa satel 


where p = 2-"/*, Since g > 1, then p < 1, and the sum of the geometrical 
series is finite; thus 2.471 holds. 
We denote by F,(ao, p’ — 1) the family of those sets V € F, for which 


2.475 J teste du > agu(V); 


then H, = oF, (ao, p’ — 1). 











252 C. A. HAYES, JR. AND C. Y. PAUC 


To each point x € Cy C lim sup H, there corresponds a sequence of natural 
numbers m, such that 


x E V(n;), V(m;) € F,, (ao, p’ = 1). 
The sequence of the sets V(m,) is an x-contracting sequence of B, thus x € E, 
and Cy C E. 
From 2.474 and 2.475 it follows that 


¥o( V(m,))/u(V(m,)) > (f. ew cy" dn) /u(V(0,)) > ao 


from which we obtain 2.473. 


THEOREM 2.48. If B is a D-basis which differentiates the u-functions, where 
1<q< ©, and if p is so defined that p-' + q—' = 1, then B is an S®’)-basis 
for each number p' such that 1 < p’ < p. 


Proof. Since q > 1, and B differentiates the u1®-functions, then B must 
differentiate the u@-functions, that is, the integrals of u-measurable functions 
which are bounded on each set G®,. By Theorem 2.12, B is an S“-basis. 
Next, we define > and go as in Lemma 2.47. In case pp = ©, it is clear that B 
is an S®”-basis and the theorem holds. In case 1 < py < ©, Lemma 2.47 
tells us that for each number gq’ such that 1 < q’ < go, there exists at least 
one »“")-function which B fails to differentiate. Our hypotheses thus compel us 


to conclude that g > go, hence p < po, from which the statement of the theo- 
rem is seen to be true. 


Remarks. Theorem 2.48 is not a clear-cut converse theorem because it 
does not say that B is an S®-basis. We conjecture that a D-basis can be 
constructed, which is an S®”-basis for each p’ < p, yet fails to be an S®- 
basis. 

According to Zygmund (29, pp. 143-144), the interval basis in the plane 
differentiates the u“-functions for g > 1. Therefore, by Theorem 2.48, it is 
an S®)-basis for all finite p > 1. There exists a blanket (10, pp. 294-295) 
which is an S“-basis but is not an S®-basis for any p > 1. The two bases 
just mentioned are extreme cases in the continuous chain of S®-bases. 


2.5 An individual differentiability criterion of Busemann-Feller type. 


Throughout this section we assume that B is a D-basis and that (G,) holds. 


DEFINITION 2.51. For M € M, we denote by M the element (soma) of 
M = M/N corresponding to M; that is, the M-coset of N. If Z is any sub- 
family of M, and 3 is its image by mapping M — M, then we define the union 
(mod N) of the Z-sets, namely o*(Z), as any set corresponding to the union 
or join of the 3-elements in M/N, regarded as a complete lattice (26, pp. 378- 
380). The lattice relation is so defined that Z’ > Z” holds if and only if for 
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every Z’ € Z, and Z” € Z”, the relation Z’ > Z” (mod N) holds. The union 
o* (Z), defined modulo N, is thus characterized by the property that every 
M-set which includes each Z-set (mod N) includes o*(Z) (mod N). There 
exist enumerably many Z-sets Z;, Z2,...,Z,,... such that 


U Z, = o*(Z) (mod N). 


If all the Z-sets are included (mod N) in an M-set of finite measure, then 
o*(Z) is any M-set of minimal measure including each Z-set (mod N). 


DEFINITION 2.52. If M denotes a bounded M-set (nucleus), 7 a positive 
number, a a number such that 0 < a < 1, then, similarly to Busemann and 
Feller (1, p. 230), we define the halo o,,,(M) as the union (mod N) of the 
B-constituents V for which u(M-V) > apu(V) and 6(V) < ». 


DEFINITION 2.53. If ¥ is an M-measure, M € M, a > 0, and 7 > 0, then 
we denote by oay.,(M) the union (mod N) of the constituents V for which 
¥(M-V) > an(V) and 6(V) < ». The set oay,,(M) is called a y-halo. 


Remarks. The modification of the Busemann-Feller definition involving 
the strict union is due to the possible non-measurability of the strict union in 
our more general sense. 

The term “halo” was first used by K. O. Househam in his talks in Capetown, 
1950, on A. P. Morse’s differentiation theory, to denote Morse’s set A: 8 
(14, p. 208). We diverge from the colloquial use of the term by permitting 
our halos to have points in common with the nucleus or even to include the 
nucleus. However, all our halo conditions control the proper halo; that is, 
the part of the halo outside the nucleus, thus retaining the basic meaning of the 
term. 

The relation between o,,,(E), defined in 2.4, and the notion just defined may 
be written o.,,(E) = oay,,(M), provided M = 6(E), y is the indefinite integral 
of that function coinciding with {eg}’ on @E, zero elsewhere, and 9 = v(E). 


THEOREM 2.53. If B is a D-basis and an S‘-basis, and ¥ is a non-negative 
u-integral, then a necessary and sufficient condition that B differentiate y is the 
following halo evanescence condition: For any bounded non-increasing sequence 
of M-sets M,, with lim »(M,) = 0, any non-increasing sequence n, of positive 
numbers with lim n, = 0, and any a > 0, we have 


w {lim (¢a,9.2.(M,))} = 9. 


Proof. We first establish the sufficiency. We show that the halo evanescence 
property implies the Vitali ¥-property in the e¢-covering version. We let X 
denote a bounded subset of E, V a B-fine covering of X, ¢ a positive number. 
We shall prove the existence of an M-family E of V-sets which is an ¢-covering 
of X, with y-overflow with respect to X, and y-overlap, both less than «. 
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For a suitable N, we have X C G*y (recall Definitions 1.31). Pruning V if 
necessary, we may assume that all the V-sets lie in G°y. Since B is an S“-basis 
and a D-basis, then corresponding to each positive integer m, there exists an 
M-family E, of V-sets such that if S, = cE, then 


2.531 S, > X (mod N*), u(S,— S,*X) < 2, w(Ey, uw) < 27, »(E,) < 1/n. 
Putting 


On - 6E,., Dz _ U Or, 


k=n 


we have u(O,) < 1/2"*', u(D,) < 1/2". 
Next we define a = «/(f(X) + 1). We denote by H, the family of those 
E,-sets V for which 


2.532 ¥(V-0,) < ap(V). 
We have 
2.533 w(H,, v) 


J euteray< J once) av 
2 V(V- On) ad w(V) < a(u(S,) + (En, »)) 


< a(a(X) +27" +2") < a(a(X) +1) = 


Now E, is an 0-covering of X. The constituents in E, — H, are included in 
Fa¥.1/n(On), (mod N*); therefore H, covers 


X — X- oay,1m(Ox)(mod N’*), 


and consequently 
2.534 oH, > {X — X- o44,1m(Dx)} (mod N’). 


Since the sets D, are all included in the set G°y of finite outer measure, and 
form a non-increasing sequence with lim y»(D,) = 0, we may invoke the halo 
evanescence property to conclude that 


lim u(¢a,9,1(D,)) = 0. 


There exists a positive number 7 such that for any M-set M C G°y, for 
which u(M) < 7, we have ¥(M) < e. We fix a so that 


2.535 #(¢a,¥,1(Da)) < 0, ame < . 


We may and do further assume that 7 < «. Then the family E = H, corres- 
ponding to this index satisfies the e-covering condition due to 2.534 and 2.535. 
Using the second relation in 2.531 and the fact that cE C S, C G°y, we have 
¥(cE — cE-X) < «; that is, the y-overflow of E with respect to X is less than 
e. Finally, the y-overlap of E is less than e by virtue of 2.533. 

We now attend to the proof of the necessity. We consider an arbitrary 
non-increasing sequence of bounded M-sets M;, M2,...,M,... with lim 
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u(M,) = 0, an arbitrary non-increasing positive sequence 4, d2,...,5,... 
with lim 6, = 0, and an arbitrary a > 0. We put 


He = 024.3.(Mz), H= 1 H,. 


Since the halo o2y,3(M) is a non-decreasing function of 4, it follows readily 
that for any pair of positive integers m, », we have 


H Cc Ga,¥.8,(M,); 
hence, for each such », 
2.536 HC Nl oay.s.(M,). 


For each such pair of positive integers m, v, there exist enumerably many 
constituents 


a a 
such that 
2.537 5(Vi.») Son, ¥(Vi.e- M,) > on(Vi,,), 


Sa.» = U Vi, = oa ,s,., (M,) (mod N). 


Corresponding to each point x of the set 
H,=5,.,, 
there exists a sequence of B-constituents W, satisfying the relations 
2.538 x € Wr, 8(Wa) <b, (Wa: M,) > ap(W,). 
We let f denote the integrand of y, so define 7 that 


r(x) =f(x)ifx € M,, 1,(x) = Oifx ¢ M,, 
and let 


po(M) = f role) de, 
for M € M. We deduce from 2.538 that 


2.539 D*p,(x) > a 


for each x € H%*,. 

Since B is an S-basis and differentiates y, then by Theorem 2.23, B 
differentiates the non-negative Radon integrals dominated by y for almost 
all x € E. Hence, from 2.539, 7,(x) >a almost everywhere in H*,. But 
r,(x) = 0 for each x € M,, thus H*, C M, (mod N*), which, by 2.537, 
means that 


N oa.v.%.(M,) C M,(mod N°); 


using 2.536, we observe that H C M, (mod N*). Since » is arbitrary and lim 
u(M,) = 0, we conclude that u(H) = 0, which completes the proof. 
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§3. EXAMPLES OF BASES POSSESSING THE VITALI PROPERTY FOR. RADON 
MEASURES 


Throughout this section the setting of 1.1 is adopted. 


3.1 Preliminary definitions. 


DEFINITION 3.11. By external D-closed set we shall mean the R-complement 
of a G-set; A will denote the family of all such sets. 


DEFINITION 3.12. We say that A is a (Morse) disentanglement function 
if and only if A is a positive finite function defined on the spread (family of 
the constituents) of the basis B (14, p. 207). 


DEFINITION 3.13. If @ is a fixed number greater than 1, A is a disentangle- 
ment function, and V» is a B-constituent, then the Morse halo H(A, a, Vo) is 
the union of those constituents V which intersect Vo, and for which A(V) < 
aA(Vo). The halo dilatation p(A, a, Vo) is defined as the ratio 7(H(A, a, Vo))/ 
u(Vo). 


DEFINITION 3.14. We shall say that the basis B has the strong Vitali 
property (abbreviated (S.V.)) if and only if for each e > 0, each set X C E of 
finite outer measure, and each B-fine covering V of X there exists an (enumer- 
able) M-family E of V-constituents such that, for S = cE: 


(S.V.1) X — X-S € N*; 
(S.V.2) w(S — S:-X) < «; 


(S.V.3 str.) the E-constituents are pairwise disjointed. 


If, in (S.V.3 str.) we replace the strict disjunction by 0-disjunction, that is, 
disjunction mod N*, we obtain the strong Vitali property mod N*; if we discard 
(S.V.2), we have the reduced strong Vitali property (abbreviated R.S.V.). 
Recalling the Definitions 1.65, we find it convenient to designate as an 
S@)-basis, any basis having the property (S.V.) mod N*. 

The straightforward proofs of the following are omitted. 


Proposition 3.15. (S.V.) mod N* implies the Vitali property for u-finite 
u-integrals. 


PROPOSITION 3.16. (R.S.V.) and Haupt's adaptation property together imply 
the Vitali property for Radon measures. 


DEFINITION 3.17. We say that B has the generalized Morse halo property 
(14, p. 213, Def. 6.4) if and only if there exists a > 1 and a disentanglement 
function A for which 


sup{lim sup[A(M,(x)) + p(A, a, M,(x))]} < @ 


for u-almost all points x € E. Here, as in 1.1, the limit superior is taken for a 
sequence M,(x) and the supremum is taken over all x-converging sequences. 
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Remarks. The strong Vitali properties lack the flexibility of the Vitali 
properties of §1. In their formulation, one cannot replace the 0-covering 
condition by an e-covering one, nor in the (G,) case, replace the phrase “‘of 
finite outer measure”’ by “bounded.” However, such alterations are permissible 
if the constituents are A-sets. 


3.2 Generalized Morse bases. 


FUNDAMENTAL THEOREM 3.21. We suppose that B is such a basis that for each 
x € E, the sets of every x-converging sequence contain x, and each B-constituent 
is a member of A. We assume, further, that each M-set of finite measure is a 
subset of some G-set of finite outer measure, and that Morse's halo property holds. 
Then B possesses the property (R.S.V.). 


The proof of this theorem occurs in (21, p. 80). 


Remarks. In Morse’s version of the fundamental theorem, R is a metric 
space, uw a classical Radon measure, B is a blanket F. Since the contraction 
process is defined by means of the metric, the (metrically) open sets belong to 
G, the (metrically) closed to A. Thus a Radon measure in the classical sense 
is a Radon measure in the sense of 1.31, the reference sequence G°;, G°s, ... 
consisting of concentric open spheres, whose radii tend to infinity. Morse 
assumes that V € F(x) implies x € V, and that the B-constituents are closed. 
Without the closeness assumption for the constituents, B need not be strong 
as the following examples confirm. 


EXAMPLE 3.22. R is a plane Euclidean space, yu is plane Borel measure, 
and £ is the open unit square with principal vertices at (0,0) and (1, 1). 
To avoid repetition, throughout this discussion, ¢ will denote an arbitrary 
point of E, m an arbitrary positive integer. We let T, denote the set of points 
in R of the form (r/2", s/2"), where r and s are arbitrary integers. K, denotes 
the family of closed squares whose four vertices are points of T,, with sides of 
length 2. Each point ¢ lies in or on the boundary of at least one square in 
K,,; we associate, with each such ¢, exactly one square J,,, in K, such that 
t € I,.. We define J’,,, as the square concentric with J, ,,, with sides parallel 
to the axes and three times as long as those of J, ;. 

At each point z € T,, we construct a square centered at z, with sides 
parallel to the axes, and of length 2-*". We let H, denote the family of all 
such squares, and we define 


We further define I”... = In. + I'n.eJn. Finally, we so define the blanket F 
with domain E that F(t) is the family consisting of the sets 7”; ,, I’’s..,..., 
Tite ede 

For each integer m > n+ 1, there are not over 16-2°"-* points of T,, 
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lying on or in J,,;, thus not over 16-2*"-*" members of H,,, each of u-measure 
2-™, intersecting J’, ,. Therefore 


BT's. . Jn) < 16 p> ans ” < eta (Ie. 1); 


m=n+1 


since > 1, we have 
3.221 wT.) S was) & eU'ne* Jn) < 3H Uns) < Gu). 


We consider a subblanket of F, say Fi, with spread F,, having the property 
that each member of F; is included in E. We let G be any countable subfamily 
of F, whose y-overlap is zero. Each set 8 € G is a set J”, ,, and we may 
associate with 8 the corresponding set 6’ = J’, ,; we let G’ denote the family 
of the corresponding sets 8’. Due to our construction, it follows that G’ has 
u-overlap zero. Using 3.221 above and the fact that oG C E we obtain 


u(S— 0G) > w(S) — 2 u(6) >1- 42 w(6’) = 1 — fuG’) >1- 4 = 4. 


Thus, no countable subfamily of F,; whose yu-overlap is zero can cover u-almost 
all of E. At the same time, if we define A(8) = diam 6 for 6 € F, then it is 
clear that Morse’s halo property holds. 


EXAMPLE 3.23. In this example, R and E are both the set of all real numbers, 
u is linear Borel measure, V° denotes a fixed open subset of the open interval 
I = (—1, 1), containing the point x = 0, everywhere dense in J, with n(V°) = 
26°, where 0 < 6° < 1. We define V(x, ¢), for ¢ > 0, as the open set image of 
V° by the direct homothetic transformation carrying the interval (x — ¢, 
x +f) onto J. F(x) is defined as the family of all sets V(x, ¢), ¢ > 0. We 
define A(V) = w(V) for V = V(x,f); hence A(V) = 2¢6°. From this it 
follows that H(A, a, V(a, ¢)) = (x —¢(2a+1), x+¢(2a+1)); thus 
p(A, a, V) = (2a + 1)/6°. Since 

lim sup A(V) = 0 
F(z) «Vz 
for each real number x, then Morse’s halo property is valid. 

We let F; denote a subblanket of F with domain J, with the property that 
the closure of every member of the spread of F; is included in J. We consider 
any countable subfamily G of the spread of F,, whose y-overlap is zero. 
If 8 = V(x,¢) € G, then the closure #’ of 8 is the closed interval [x — ¢, 
x + ¢], and u(8) = 6° u(6’). If G’ denotes the family of the corresponding 
sets 6’, it follows from the density of 8 in §’ that the yu-overlap of G’ is zero. 
Thus, since oG C oG’ C I, we have 
w(I—oG) > u(I) — 2) u(8) = 2 — 0°. 3 u(6') = 2 — O°u(oG’) > 2 — 26°>0. 


BeG 


This shows that F has not the property (S.V.), although F is regular. 


THEOREM 3.24. If B is such a basis that each B-constituent of any x-convergent 
sequence at any point x of E includes an A-set containing x, and if Haupt's 
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adaptation property and Morse’s halo property both hold, then B possesses the 
Vitali property for Radon measures. 


Proof. We let ¥ denote an arbitrary Radon measure, X an arbitrary 
bounded subset of E, say X C G°y, V any B-fine covering of X, and « any 
positive number. Using Proposition 1.38 we have only to show the existence 
of an M-family E of V-sets such that 


A(X — X-cE) < «, w(E,y) < « 


For n = 1, 2,... we denote by B*, the set of the B-sequences M,(x) such that 
x € X, whose constituents V belong to V, are included in G°y, satisfying 
3.241 A(V) + p(A, a, V) < 2, 


by 7 a number for which 0 < » < 1, and by X, the domain of B*,. We have 
X,CX:C...CX,.... Since G°y is a G-set, V a B-fine covering of X, 
and Morse’s halo property holds, it follows that lim X, = X (mod N*), 
hence lim »(X,) = u(X). Since u(X) is finite, we may and do choose n so 
that 

3.242 u(X) — w(X,) < 


We put ¥° = sup ¥(M) for M 3 M C G°y, and, since ¥° < @, select 6 
so that 
3.243 0< (@'-1lYW<«¢.0<6<1. 


For each B*,-sequence M,(x), we determine all possible sequences A ,(x) of 
A-sets, for which the properties 


x € A,(x), u(A.(x))/u(M.(x)) > 2, ¥(A.(x)) > 6¥(M,(2)) 


all hold. The corresponding sequences A ,(x) may all be regarded as converging 
to x. The fact that for each B*,-sequence such associated sequences exist 
follows from the first assumption of the theorem and the universal lower 
approximation property of the A-sets, applied to ¥ + yu, which is implied by 
Haupt’s adaptation property (recall Proposition 1.37). Thus if A* denotes the 
family of the sequences A ,(x), then A* is a basis with domain X,,. 

Following Morse, we associate with each A*-constituent V* a B*,-constituent 
V = D(V*) (the dilatation of V*) satisfying the conditions 


V*C V,u(V*) > m(V), ¥(V"*) > (VY), 
and we define on the spread of A* the disentanglement function A* by 


3.244 A*(V*) = A(V), 
where V = D(V*). 

We observe that the halo H*(A*, a, V*), which is a union of A*-constituents, 
is included in H(A, a, V), hence 


3.245 B(H*(A*, a, V*))/u(V*) = p*(A*, a, V*) < p(4,a, V)/n. 
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Combining 3.241, 3.244, and 3.245, we deduce 
3.246 A*(V*) + p*(A*, a, V*) < (A(V) + p(A, a, V))/n < n/n, 


where V = D(V*). Thus A* possesses the Morse halo property, in fact, 
uniformly. The assumptions of Theorem 3.21 hold, hence A* has the property 
R.S.V. Thus there exists a disjointed M-family E* of A*-constituents, such 
that for S* = cE*, S* > X, (mod N*). 

We define E as the M-family obtained from E* by the correspondence 
V = D(V*). The E-constituents belong to V and lie in G°y. Since D( V*)D V*, 
then 

S = cE D S* D X, (mod N*), SD X, (mod N). 


Using 3.242, we have 
u(X — X-S) < u(X) — (X,) < 
Also, since S C G°y, we have ¥(S) < ¥° < @; hence, from 3.243, 
w(E, ¥) = 2 (VY) — ¥(5) 
<o"( 2 HV") — WS") = © = 1) WS") <e 


v*«E* 
The M-family E fulfils the required conditions, which means that the basis 
B possesses the Vitali property for Radon measures. The following is the 
immediate consequence of Theorem 1.64. 


CorOLLARY 3.25. Under the assumptions of Theorem 3.24, B differentiates 
every Radon measure. 


Remarks. The essential steps in the foregoing proof are (i) the contraction 
of the B*,-sets into A-sets of nearly equal y-measure, with s-exhaustion 
power greater than 7, (ii) the transfer, expressed by A*(V*) = A(V), of the 
function A from the original sets to the new ones. The second step shows 
the power of Morse’s methods residing here in the choice of the new disen- 
tanglement function. 

In Morse’s paper (14) it is remarked that the metric axiom 


5(p’, p’’) = O implies p’ = p” 


is never used. Discarding this axiom, the (metric) closure P of the set {p}, 
consisting of the point p only, is the set P of points x with 6(p, x) = 0. To the 
various points x of P may be attached different families F(x), but any Borel 
set containing p must contain P. This is why the first assumption in Theorem 
3.24 is satisfied under Morse’s relaxed hypotheses. 

Morse’s halo property in the general case involves the contracting process; 
this is not so, however, in the special case of uniformity. If; for a bianket F, 
there exists a > 1 and A such that A(V) + p(A,a, V) is bounded on the 
spread of F, then the same is true of any blanket with the same spread, in 
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particular for the blanket F* so defined that F*(x) is the family of F-con- 
stituents containing x, which is a D-basis. Not only does F differentiate any 
Radon measure but F* does, too. 

Referring to Theorem 2.29 and Theorem 3.24, the Examples 3.22 and 3.23 
give a negative answer to a question raised by O. Nikodym in 1947: “Is the 
property (SV) mod N* equivalent to the validity of the differentiation theorem 
for u-finite u-integrals?”’ 


FUNDAMENTAL THEOREM 3.26. We suppose that B is a basis whose 
constituents are A-sets, and that there exists a disentanglement function A such 
that 


sup{lim sup p(4, 1, V)} < @ 


almost everywhere on E. We assume that B has the property (L), namely, corres- 
ponding to any subset X of E of finite outer measure, any B-fine covering V of X, 
and any « > 0, there exists an M-family E of V-constituents such that for S = cE 
we have 

(V1) S > X (mod N*), 

(V2) u(S — S-X) <«. 


Then B has the property (S.V.). 


Proof. This can be found in (21, p. 83). See also the Remarks after Theorem 
3.27. 


Remarks. Property (L) asserts the existence of an M-family covering 
X (mod N*) without any overlap condition. The essential step in the proof 
of the theorem is called disentanglement, and rests upon the halo property. 
For this reason (L) may be regarded as a rough or pre-Vitali property. In 
the formulation of (LZ), (V1) may be replaced by u(X — S-X) < «, and in 
case (UG) holds, (V2) may be dropped. 

Property (LZ) is named after Lindeléf. In fact, if R is a metric space with a 
countable basis (separable, in Fréchet’s terminology), 4 a Radon measure, F 
a blanket such that the constituents of F(x) are open sets containing x, then 
Lindeléf’s classical topological property implies (even expresses) the property 
(L). 

In Theorem 3.26, it is not assumed that the constituents of an x-converging 
sequence contain x, as is the case in Theorem 3.24. 


THEOREM 3.27. If there exists a disentanglement function A such that 
sup{lim sup p(A, 1, V)} < @ 


almost everywhere on E, and if Haupt’s adaptation property and (L) holds, then 
B possesses the Vitali property for Radon measures. 


Proof. We let ¥ denote a Radon measure, X a bounded subset of E (we 
assume X C G°y), V a B-fine covering of X, « a positive number. Due to 
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Proposition 1.38, we need to prove only that there exists an M-family E of 
V-sets such that a(X — X-cE) < «and w(E, y) < «. 


For n = 1,2,... we denote by B*, the set of B-sequences M,(x) with 
x € X, whose constituents belong to V, are included in G°y, and satisfy 
3.271 p(A, 1, V) <n; 
we denote the domain of B*, by X,. Since the sequence X,, X2,...,Xn,--- 
is increasing with lim X, = X (mod N*), we may and do choose » so that 
3.272 u(X) — u(X,) < ¢; 
we let Z = X,. 


We denote by V' the spread of B*,, by 7 a fixed number, 0 < 9 < 1, byéa 
positive number such that 


3.273 0< (@'- 1) <e«, 
where ¥° = sup ¥(M) for M 3 M CG,,. Finally we fix an auxiliary sequence 
of positive numbers ¢, whose sum is less than e. 


Property (ZL) allows us to select an M-family M,, M2,...M;,... of V-sets 
such that 


3.274 T='UM,>Z(modN*), w(T-T-Z)<e 


j=l 


where e' = min { €, 7 @(Z)/2(m + »)}. 
Since lim »(T,) = u(T) < ©, where 


@ 
Ze = U M;, 
j=1 


we can choose Q so that 


3.275 u(T — Tg) < e. 
Now, using 3.271, we can extract from the finite family M,, M2,..., Mga 
disjoint subfamily M',, M's, ..., M',,, such that for 


@ 
T= UM", 
k=1 
we have 
3.276 u(T") > u(T¢)/». 


This is the disentanglement step. 
With each set M', we associate an A-set A’, C M', in such a way that 


3.277 u(A'y) > nu(M'), (A) > 5p(M',). 


We evaluate the Z-exhaustion of 


@ 
Si U A, 


k=1 
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by which we mean the value u(S'-Z) = g(S'-Z). Since S'C T, Z C T (mod 
N*), then, using 3.274, 3.275, 3.276, and 3.277, we derive 
u(S+Z) > u(Z) + u(S") — u(T) 
> a(Z) + n(a(Z) — «')/v — (a(Z) + €') 
= 9f(Z)/v — e(n + v)/v > na(Z)/2r. 


So far we have been able to select a finite disjoint subfamily M',, ... M',, of 
V'-sets (hence V-sets), with a Z-overflow less than ¢«,, and to contract each 


M', into an A-set A', such that the new family has a Z-power of exhaustion 


greater than »/2y; that is, the ratio of Z-exhaustion is greater than ¢ = »/2v. 
We repeat the process with Y = Z — Z-S' and the family V’ consisting 


of those V'-constituents which do not intersect any A, k = 1,2,...,q@1, 
to produce two finite disjoint subfamilies M*,, M*.,..., M*,, and A*,, A*s, 
., A*,, satisfying, for / = 1,2,...,q2, & = 1,2,...,q:, the relations 
2, € V*, A*,C M?,, A®, € A, A'y-M?, = 0, u(S*-Y) > ¢a(Y) 
and - 
p(S? — S?-Y) < és, 

where 

ds 

S= UA’, 
l=1 


The iteration of this exhaustion process yields two M-families, namely, E 
consisting of the sets M',,..., M%,,, M*,,...,M*%,, M*1,..., Maser ss 
and C consisting of the sets A\, Ty At, ere oe? . Pee oe 
such that (i) the E-constituents belong to V, (ii) S = cE > Z (mod N*), 
(iii) the overflow a(S — S-Z) < «, (iv) to each C-set A there corresponds an 
E-constituent V = D(A) (the dilatation of A) satisfying A C V, ¥(A) > 
éy(V). In view of these facts it is easily seen that the remainder of the proof 
involves merely a repetition of a portion of the proof of Theorem 3.24, hence 
E has met the two necessary requirements, and the theorem is proved. 


Remarks. The proof of Theorem 3.26 is obtained from the preceding proof 
by discarding the contraction process. 

Haupt’s adaptation property is used only in the contraction process. 
If the weaker condition (UG) is substituted, then the assertion of the theorem 
remains true for Radon integrals. 

In the proof of Theorem 3.24, Morse’s function A is used to disentangle 
the infinite family of constituents to produce the desired countable family, 
hence the necessity of a choice condition based on the boundedness of A. 
In the foregoing proof we disentangle a finite family and repeat the process, 
producing the desired family by juxtaposition of sections. 

In the proof of Theorem 3.27, we do not treat of Morse’s halo H(A, a, Vo) 
itself, but only with a finite family of halo constituents, which may be aptly 
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called a partial halo. For this reason we define H’(A, a, Vo) as the union mod 
N (Definition 2.51) of the constituents V intersecting Vo, with A(V) < aA(Vo). 
In the formulation of Theorems 3.26 and 3.27 we can replace the halo dilata- 
tion p(A, a, Vo) by 


p' (A, a, Vo) = u( A’ (A, a, Vo))/p(Vo) 


clearly p’ < p. 

In defining H or H’ we accept all constituents intersecting V» and satisfying 
A(V) < aA(V.). The incidence requirement is V-V»o # 0. Correspondingly, 
disentanglement requires the determination of a strictly disjointed family of 
constituents. The incidence requirement may be altered to essential intersection, 
that is, u(V-V>») > 0, and in turn the disentanglement changed to require the 
production of a family of pairwise mod N disjointed constituents. This point 
of view can be adopted in Theorem 3.26 if we wish to achieve the strong 
Vitali property mod N, and in Theorem 3.27, if we restrict the assertion to 
Radon integrals. The stronger we make the incidence requirement, the weaker 
the halo condition becomes. Busemann and Feller gave a necessary and suffi- 
cient halo condition for the validity of the density theorem, equivalent, by 
virtue of Theorem 2.12, to the Vitali u-property, for Euclidean derivation 
bases of the D-type, the constituents being open sets, and the contraction 
being defined metrically. We have already encountered a halo of Busemann- 
Feller type in 2.4, namely o,,,(E). 


3.3. Half-regular and regular branches of a derivation basis. 


DEFINITIONS 3.31. To some of the sequences M,(x) of the basis B we 
correlate Moore-Smith sequences M*,(x) of M-sets of positive finite measure, 
with the same indices and the same convergence point, such that for each 
sequence 
(R1) lim inf u(M*,)/u(M,) > 0 
(R2) lim sup »p(M*, — M,-M*,)/u(M,) = 0. 


The set B* of the sequences M*,(x) is called a half-regular branch of B, and 
B* UB is called a half-regular extension of B. If (R2) is replaced by the 
strengthened requirement M*, C M,, then B* is called a regular branch, and 
B* U B is called a regular extension of B. (6, 9.7). 

De Possel has shown that the Vitali u-property is preserved by half-regular 
extension. An example exists in which the sequences M,(x) are ordinary 
sequences of concentric closed squares in the plane (10, pp. 292-295). The 
corresponding set M*,(x) consists of M,(x) augmented by small satellite 
squares, in such a manner that lim »(M*,)/u(M,) = 1, where u denotes plane 
Lebesgue measure. 

Example 3.23 deals with a regular branch of the basis of closed concentric 
intervals on the line. It shows that the strong Vitali property is not preserved 
under regular extension. 
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We shall now investigate the behavior of the Vitali properties under regular 
extension. 


LEMMA 3.32. We let y denote a Radon measure. We assume that there exists 
¢ > 0 such that corresponding to any bounded set X C E of positive outer measure, 
any B-fine covering V of X, any u-cover M of X, and any ¢ > 0, there exists an 
M-family E’ of V-constituents with union S’ = cE’ for which 


A(X-S’) > ga(X), V(S'—-S’M)<, w(E’, yp) < €. 
Then B has the Vitali p-property. 


Proof. We assume that « > 0 and attempt to find an M-family satisfying 
(V1), (V2), (V3) of Definitions 1.33. We introduce a sequence ¢;, €,... of 
positive numbers whose sum is less than ¢«/2. Since X is bounded and V is a 
B-fine covering of X, we may and do assume that both X and all the V-sets 
lie in some G°y. 

By hypothesis, there exists an M-family E, of V-sets for which 


u(X1-S)) > ta(X)), v(S; = S;M,) < €1, w(E, y) < €1, 


where S,; = cE,, X; = X, M; = M. We repeat this process on the sets 
Xo = X1 — XSi, M2 = M, — M;1S;; by iteration in this way, we obtain a 
sequence of M-families E,, a nested sequence of sets X, C X, with a nested 
sequence of u-covers M,, fori = 1, 2,..., such that, for each such i, putting 
S; _ cE, 
u(X¢S) > ca(X,), WS; -—-S¢M) <€, w(Ey wv) < & 

Finally, we let E denote the union of the families E,. 

Since the rate of exhaustion at each step exceeds ¢, then E exhausts X; 
that is S = cE > X (mod N*). We complete the proof by evaluating the 
y-overflow and y-overlap. We have 


¥(S— S-M) < > W(S,-— S,;-M) <6 
5 | 


w(E, y) = p> ¥(V) — ¥(S) 
<= (= W(V)) — W(S-M) = L (> v(V) — WS, M,)) 
-2 (> (¥(V) — v(S,))) + L WS, — My S,) 


> w(E,, ¥) + p> ¥(S, — M;- S;) <«. 
i 


Remark. If Haupt’s adaptation property holds, then the overflow require- 
ment in the above may be omitted. The same is true if the weaker (UG) 
holds and y is a Radon yg -integral. 


LemMA 3.33. If y is a Radon measure, r = ¥ + u, B possesses the Vitali 


t-property, and B* is a regular branch of B, then B* possesses the Vitali p-pro- 
perty. 
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Proof. We let X denote a subset of E of positive outer measure included in 
some G°y, M a u-cover of X, V* a B*-fine covering of X, and « a positive 
number. 

For n = 1, 2,... we define B*, as the set of all B*-sequences S* consisting 
of the sets V*,(x) for which (i) x € X, (ii) V*,.(x) € V* for all ¢, and (iii) 
there corresponds to each S* a B-sequence D(S*) = S, consisting of those 
sets V,(x) for which V*, C V, and w(V*,)/u(V,) > 1/m for all «. 

We denote by B, the set of the B-sequences associated by D to the B*,- 
sequences, B, = D(B*,), and by X, the domain of B,, which is also the do- 
main of B*,. Since V* is a B*-fine covering of X and B* is a regular branch of 
B, then X, increases with nm and lim u(X,) = u(X). As in the proof of Theorem 
3.24 we may and do choose k& so that 


3.331 u(X) — w(X,) < «. 


We shall show by Lemma 3.32 that the subbasis B*, of B* possesses the 
Vitali ¥-property. We consider a subset Y of X, of positive outer measure, 
P a p-cover of Y, T* a subbasis of B*, with domain Y (mod N*), ¢ a positive 
number. We define ¢ = 1/2k, and ¢’ = min (¢’, a@(Y)/4k). T = D(7*) isa 
subbasis of B with the same domain as 7*. Since B has the Vitali r-property, 
there exists an M-family E of 7-sets such that 


3.332 S=cED Y(modN*), r(S—S-P)<e’, w(E,7) < &”. 


With each E-set V we associate a V*-set V’ = C(V) (the contraction of V), 
with 


3.333 C(V) C V, w(V’)/u(V) > 1/k. 
We define the M-family E’, demanded by Lemma 3.32, as C(E). Clearly 
3.334 w(E’, ¥) < w(E, ¥) < w(E, 7) < &” < e; 


the y-overlap condition is satisfied. 
Putting S’ = cE’, 


3.335 V(S’ — SP) <W(S— SP) <r(S-—SP)<e&’ < €; 


thus the ¥-overflow condition is satisfied. 
We turn to the evaluation of the Y-exhaustion of ¢’, namely, 


u(S’-P) = u(S’) — w(S’ — SP). 


Since p(S’ — S’-P) < u(S — S-P) < r(S — S-P) < e’, we have u(S’-P) > 
u(S’) — ¢’. From the last relation in 3.332, w(E’, nu) < w(E, un) < ¢”’, hence 


u(S’- P) > 2H’) — 2X" > (1/k) De u(V) — 2” 
> (1/k) a(Y) — 2c” > a(Y)/2k = gn(Y). 


The basis satisfies the requirements of Lemma 3.32. Consequently, there 
exists an M-family E* of B*,-constituents, hence V*-constituents for which 
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S* = cE* D X,(mod N*), ¥(S* — S*-M) < «, w(E*, ¥) < «. 


Due to the choice of k, from 3.331 we have obtained the Vitali ¥-property in 
the ¢-version. 


The following is an immediate consequence of Lemma 3.33. 


THEorEM 3.34. If B* is a regular branch of B, and B possesses the Vitali 
property for Radon measures (resp., integrals), then B* possesses the Vitali 
property for Radon measures (resp., integrals). 


Examples 3.35. B is the cube basis in Euclidean space E;, » the Borel 
measure, M the family of Borel sets, B* consists of all sequences of Borel 
sets of positive measure converging regularly to a point. According to Lemma 
3.33, B* possesses the Vitali property for Radon measures, hence it differ- 
entiates them. 

The subbasis of B* consisting of all sequences of closed sets of positive 
measure, converging regularly to a point, is the classical Lebesgue basis which 
enjoys the strong Vitali property. We may notice that any sequence of M-sets 
of positive measure converging homothetically to a point belongs to B*. 

If we denote by B** the superbasis of B* consisting of all sequences of 
Borel sets of positive measure converging to a point x (without regularity 
condition), it is easy to see that for a Radon integral ¥(M) = Suf(M) du 
we have D*y(x) = essential maximum of f at x, Def(x) = essential minimum 
of f at x. It follows that the only Radon integrals differentiated by B** are 
the integrals of functions summable at finite range and essentially continuous 
almost everywhere. This shows clearly the loss of differentiation power when 
discarding the regularity requirement for converging sequences. Finally, 
we notice that B** is a blanket, different from both B* and the Lebesgue 
basis. 


3.4 Star blankets. To conclude, we give another example where the 
surrendering of the closeness assumption for the constituents of a basis 
means the replacement of the strong Vitali property by a Vitali property 
for Radon measures, with no loss of differentiation power. 


Derinitions 3.41. R denotes Euclidean n-dimensional space. The hub 
of a set X C R is the set of these points x € X such that the segment joining 
x and x’ lies in X whenever x’ € X; the hub radius of X at a point x € X is 
the supremum of those numbers p for which the solid sphere with x as center 
and p as radius is included in the hub of X. A star blanket (15, p. 432) according 
to Morse is a blanket F in R, whose constituents are closed sets, and such that 
for each x of its domain, 

lim sup (diam V)/(hub radius of V atx) < ©. 


F(z)? V2 


We define Borelian star blankets by discarding in Morse’s definition the 
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closeness requirement, and replacing it by the weaker demand that the con- 
stituents be Borel sets. 


Remarks. Morse proved that a star blanket in his sense possesses the strong 
Vitali property whenever » is a Radon measure. From his differentiation 
theorems he deduces the existence y—almost everywhere of a finite derivative 
for every Radon measure y. 


THEOREM 3.42. Borelian star blankets possess the Vitali property for Radon 
measures, whenever the basic measure yp is itself a Radon measure. 


Sketch of the proof. Referring to (15), all properties of star blankets exhibited 
in §5 up to the application of Morse’s Theorem 3.4 in 5.11 are valid without 
using the fact of closeness of the constituents. The statement of this basic 
theorem, but for minor changes, is as follows: R is a metric space, u a Radon 
measure, 0 < ¢ < ~, X C R, and F is a family of closed sets. Corresponding 
to each bounded open set G there exists a countable disjointed subfamily 
K of F for which cK C G, a(X-G) < tu(cK). Then every disjointed subfamily 
of F can be extended to a countable subfamily of F covering X (mod N*). 

If the F-sets are required merely to be Borel sets, instead of closed, then 
we have to change the conclusion as follows: For any Radon measure y 
and any « > 0, every finite subfamily of F whose y-overlap is less than « 
can be extended to a countable subfamily of F covering X (mod N*), and with 
¥-overlap less than e. 


THEOREM 3.43. Borelian star blankets differentiate every Radon measure, 
the basic measure pu being itself any Radon measure. 


Proof. This follows from Theorem 3.42. 


Remark. This differentiation theorem, like Morse’s theorems, implies that 
the set of points provided with yu-nullsequences is a y-nullset. 


§4. An APPROACH TO A THEORY OF DIFFERENTIATION OF ABSTRACT INTERVAL 
FUNCTIONS 


We take our setting as in 1.1. (G,) and the reduced strong Vitali property 
are assumed to hold. A denotes a finite numerical function defined on the 
spread D. 


4.1 Preliminary definitions. 


DeFinitions 4.11. Any enumerable disjointed family P of B-constituents 
is called a Vitali partition. A partition P is called V-fine if V is a family of 
B-constituents and the sets in P belong to V.P is said to be bounded if the 
P-constituents are included in some G°y. If the sum 


> A(V) 


veP 
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represents a real number (including + © and — ©), we denote it by (A, P) 
and we say that P is \-integrable. 

This last condition is always fulfilled when \ > 0. In general, however, A 
may be of variable sign, and we shall henceforth assume that P is \-integrable 
whenever P is a bounded Vitali partition. 


DEFINITIONS 4.12. A Vitali partition over X C R is defined as a Vitali 
partition covering X (mod N*). 

A Vitali partition over X is thus a Vitali partition over any set Y having a 
u-cover in common with X. From the reduced strong Vitali property it follows 
that for any bounded set X C E and any B-fine covering V of X, there 
exists a V-fine Vitali partition over X. 


DEFINITION 4.13. A set X is called \-admissible if any full B-fine covering 
of X includes a \-integrable Vitali partition over X. According to our assump- 
tion above, the bounded subsets of E are \-admissible. 


DEFINITION 4.14. For any \A-admissible set X C E we define the upper 
Vitali integral ¥°(X) or ¥°(A, X) and the lower Vitali integral po(X) or po(A, X) 
as lim sup ¥(A, P) and lim inf ¥(A, P), respectively, the limits being taken in the 
family of the \-integrable partitions P over X with the full B-fine coverings 
of X serving as a scale of fineness. We have 


y°(X) = W°(X-E), Wo(X) = wo(X-E), W(N*) = 0, N*CEN*. 


\ is said to be Vitali integrable over X if ¥°(X) = Wo(X), Vitali summable 
if ¥°(X) and Yo(X) are equal and finite. In either case the common value is 
denoted by ¥(X). 

Explicitly, \ is Vitali summable to ¥(X) if ¥(X) is finite, and corresponding 
to any e« > 0, there exists a full B-fine covering W(e) of X such that for 
any \-integrable Vitali partition P over X whose constituents belong to 
We), we have |y(X) — (A, P)| < «. 

The Vitali integrals are discussed in (18), wherein E = R, the strong Vitali 
property holds, and the B-constituents are assumed to be G-sets. Thus 
the Vitali partitions over a set X form a directed system which is used as the 
scale of fineness. The definition for the Vitali integrals is subsumed by the 
more general one adopted here. 


4.2 Differentiation theorems for Vitali summable functions. 


Lemma 4.21. Jf B has the strong Vitali property, X is a bounded subset 
of E, and a and 8 are any finite numbers, then: 

(a) ¥°(X) > af(X) whenever D* > a almost everywhere on X; 

(b) wo(X) < Ba(X) whenever Ded < 8B almost everywhere on X. 


Proof. We first establish (a). If ¥°(X) = @, there is nothing to prove. 
Accordingly, we assume that ¥°(X) < «, « > 0, and D*\ > a almost every- 


® 











270 C. A. HAYES, JR. AND C. Y. PAUC 


where on X. In accordance with the definition of ¥°(X), there exists a full 

B-fine covering W’ of X such that for any bounded W’-fine Vitali partition 

P over X, we have ¥(A, P) < ¥°(X) + «. The family V’ of the W’ constituents 

satisfying the relation A(V) > au(V) is a B-fine covering of X, hence, on 

account of the strong Vitali property and the boundedness of X, there exists 

a disjointed enumerable bounded subfamily P of V covering X (mod N*). 
We have ; 


vA, P) > a> »(V) > af(X). 


Since P is W’-fine, ¥°(X) > ¥(A, P) — «. Combination of the two relations 
yields (a), since ¢ is arbitrary. 

We turn to (b). If yo(X) = — @, there is nothing to prove; we thus assume 
that ¥o(X) > — ~, e>0, and Ds < 8 almost everywhere on X. There 
exists a full B-fine covering W’” of X such that for any bounded W’-fine 
Vitali partition P, we have 


(A, P) > Wo(X) — «. 


The family V” of the W’-constituents satisfying \(V) < Bu(V) is a B-fine 
covering of X. Due to the strong Vitali property and the boundedness of X, 
V” includes a disjointed enumerable bounded subfamily P with 


S = oP D X(mod N*), u(S — S-X) <«. 
We have 
¥(A, P) < BD u(V) = Bu(S) < B(u(X) + 6), 


and since P is W” fine, (A, P) > ¥o(X) — «. Combining, we obtain yo(X) < 
Ba(X) + e(1 + 8). From the arbitrary nature of e, (b) follows. 


THEOREM 4.22. Assuming the strong Vitali property, if the function d is 
Vitali summable over every bounded subset X of E, then the B-derivative Dy 
exists and 1s finite almost everywhere on E and D) is uz*-measurable. In particular, 
if on the bounded subsets X of E, ¥(X) can be represented as 


fife dur 


Proof. It follows readily from the preceding lemma that N’ = [D*\ = @] 
and N” = [DsX = — @] are N*-sets. 

We regard D*\ and Dsi, restricted to the set E* = E— (N’UN”"), 
as the functions f and g occurring in Lemma 1.23. Recalling the Remarks 
following Definitions 1.31, it clearly suffices to derive a contradiction from 
the assumed existence of two bounded yz*-entangled sets A and B and two 
finite numbers a and 8 with B < a, such that D*\ > a on A-and Ded < 8 
on B. However, since ¥°(A) = ¥°(B) = ¥o(B) = Wo(A), Lemma 4.21 yields 
the desired contradiction at once. 


then Dd = f (mod N*). 


~~ 
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To prove the second part of the theorem, we let A and B denote two bounded 
subsets of E of positive measure, A = B (mod N*), such that the convex 
closure of f(A) and the convex closure of Dd (B) are at a positive distance 
apart. This means that there exists two finite numbers a and 8, with a > 8, 
and either 


4.221 f(x) < Bon A, DXdX(x) >aonB 
or 
4.222 f(x) > aon A, DX(x) < Bon B. 


Since the integrand f is E-M-measurable, then f(x) < § almost everywhere on 
A-E in case f(x) < 8 on A, whence 


f Je dux < Bus(A. E) = Bp(A); 


while, according to the preceding lemma, ¥(B) > af(B) if DA(x) > a on B. 
Since f(A) = a(B) > 0, and ¥(A) = ¥(B), the inequalities in 4.221 are 
incompatible. Similarly, it follows that 4.222 cannot hold. Referring to Lemma 
1.23 and the Remarks following Corollary 1.24, we obtain DA = f (mod N*). 


4.3 An example of Vitali summable functions: The non-negative upper 
semi-additive functions. 


DEFINITION 4.31. The non-negative function \ is called upper semi-additive 
on E (with respect to B) if, corresponding to any B-constituent V and any 
e > 0, there exists a full B-fine covering W, of V-E such that for any W.-fine 
Vitali partition P, y(A, P) < A(V) + «. 


THEOREM 4.32. If \ is a non-negative upper semi-additive function, then d is 
Vitali integrable over the subset X of E and 


¥(X) = inf (V) 


for all M-families E of B-constituents* covering X (mod N*). In particular, 
if for any bounded Vitali partition P, ¥(r, P) is finite, then ¥(X) is finite on 
bounded X. 


Proof. We regard X as fixed and let 
vo = inf >> X(V), 
VeE 


let e denote an arbitrary positive number, and ¢;, €2,..., €,, . . . a sequence of 
positive numbers whose sum is less than $e. 

If yo = ©, then ¥(X) = @ and there is nothing to prove. We assume, then, 
that yo < . From the definition of yo, we are able to find an M-family 
Vi, Vo,..., Va»... of B-constituents covering X (mod N*), such that 


> A(Va) < Yo + Fe. 


5While the Vitali integral ¥(X) may be defined for vector valued functions, the right hand 


side of the equation just given presupposes a complete lattice structure. 
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For each V, we determine a full B-fine covering W, in such a way that for 
any W,,-fine Vitali partition P, over V,-E, 


vA, P,) < A(V,) + €n, 


and we define W* as the union of the W,. W* is a full B-fine covering of X. 
The theorem will be established if we prove that for any W*-fine Vitali 
partition P over X 
¥(A, P) <vyteé 


since ¥(A, P) > vo. Accordingly, we let P be such a partition, and subdivide 
P into disjointed sub-partitions P,, P, being W,-fine. This decomposition is 
certainly possible but not necessarily unique. Due to the upper semi-additivity 
of A, 

V(A, P,) < ACVa) + ens 
hence by addition 


D vA, P,) = ¥(A, P) < DAV) + Fe < wo te 


As for the second part of the theorem, if X is bounded, then X is covered 
(mod N*) by a bounded Vitali partition P for which (A, P) is finite, therefore 
Yo is also finite. 

Remark. Under the hypotheses of the theorem the function y defined on 
all subsets of EZ is a Carathéodory outer measure, meaning that it satisfies 
(C1) and (C2) of (25, p. 43). Besides, ¥ vanishes on the N*-subsets of E. 
With the Carathéodory restriction process of an outer measure to a measure 
in mind, we may expect the restriction of ¥ to the E-M-sets to be a Radon 
#eg-integral, which would enable us to apply Theorem 4.22. This proves true 
under the assumptions in (18); more on this occurs in 4.4 below. 


4.4 Morse’s addivelous functions. Morse’s definition of addivelous func- 
tions given when B is a Borelian blanket can be readily transposed to a general 
basis. 


DEFINITION 4.41. We say that the function \ defined on the family D* of 
subsets of R is addivelous if: 

(i) A is non-negative; 

(ii) D* includes the spread D of B; 

(iii) Whenever the B-constituents Vi, V2,... are disjoint and included 
in the D*-set D*, then ZA(V,) < A(D*); 

(iv) if V € D and e > 0, then there exists a D*-set D = D(V) for which 
A\(D) < A(V) + «, I(D) D V-E (mod N*).® 

Defining W,. as the family of the B-constituents included in D, we see 
that an addivelous function is upper semi-additive. The strong Vitali property 
secures the existence of a Vitali partition V°:, V°2,..., over E. We define 
(recall the final Remark after Definitions 1.31) 


*For the definition of J(D) refer to 1.1. 
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R®, = D(V;i) UD(V2) U...U D(Y,), 


and regard a Vitali partition P as bounded if, for some N, the P-sets are 
included in J (R°y). Thus for any bounded Vitali partition P, ¥(A, P) is 
finite. 

Theorem 4.32 is applicable, and expresses the equivalence between Vitali 
integration and Morse’s regularization, for the subsets of E. 

In Morse’s case, B is a Borelian blanket. Carathéodory's condition (C3) 
is clearly fulfilled. The restriction of y to the E-M-sets is a u,-integral, more 
precisely a Radon y,g-integral with respect to the expanding reference sequence 
R°,. Theorem 4.22 can be applied; DA is equal to a Radon-Nikodym u,- 
integrand of y|E-M. In an unpublished lecture delivered before the American 
Mathematical Society in 1948, Morse gave an interpretation, when E = R, 
of the indefinite integral of DA as y|M, where ¥(X) is defined as the infimum 
of numbers of the form 


p>: (V), 


where E is such a countable subfamily of D that E covers almost all of X. 
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POINT-FINITE AND LOCALLY FINITE COVERINGS 


ERNEST MICHAEL 


1. Introduction. An interesting feature of recent topological develop- 
ments is the increasingly important role played by locally finite coverings.' 
Point-finite coverings, on the other hand, even though conceptually simpler, 
have received very little attention. And deservedly so, since they are much 
less useful. Nevertheless, it sometimes happens (as it did to the author in 
(5)) that one is confronted by a covering which is known to be point-finite, 
but not necessarily locally finite. When does such a covering have a locally 
finite refinement? The purpose of this paper is to provide some answers to 
this question in the following two theorems (which the author happens to 
need in (5)), and to construct some counter-examples to certain related 
conjectures. It should be pointed out that, while Theorem 2 seems to be new, 
Theorem 1 is known (6, Theorem 3 and Lemma 3), and is stated here only for 
completeness, and because it is needed in the proof of Theorem 2. 


THEOREM 1 (Morita). Every countable, point-finite covering of a normal 
space has a locally finite refinement. 


THEOREM 2. Every point-finite covering of a collectionwise normal space has a 
locally finite refinement. 


Whether Theorem 1 remains true with “‘point-finite” omitted is one of the 
major unsolved problems in point-set topology, and it is equivalent to the 
problem of whether the cartesian product of a normal space and the closed 
unit interval is normal (2; 4). It is of course not possible to omit “‘point-finite” 
in Theorem 2, since a collectionwise normal space need not be paracompact. 
And finally, the following two counter-examples show that two other plausible 
directions for improving Theorem 2 are also barred: 


Example 1. There exists a normal space, every point-finite covering of which 
has a locally finite refinement, but which is not collectionwise normal. 


Example 2. There exists a normal space, not every point-finite covering of 
which has a locally finite refinement. 


In §3, where these examples are constructed, it will be shown that they can 
even be slightly strengthened, and that, in particular, the spaces can be chosen 
to be perfectly normal. 

We conclude this introduction with a quick review of our principal concepts. 
Let X be a Hausdorff space. In this paper, a covering of X is a collection of 
open subsets of X whose union is X. A collection W of subsets of X is point- 


Received May 7, 1954. 
1. All terms are defined at the end of this introduction. 
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finite if every x € X is an element of only finitely many A € J; it is locally 
finite if every x € X has a neighborhood which intersects only finitely many 
A € &. If Vand Ware coverings of X, then Wis a refinement of V if every 
W € Wis a subset of some V € ¥. Normal spaces are, of course, familiar. 
According to Bing (1), X is collectionwise normal if, whenever {A,} is a collec- 
tion of subsets of X which is discrete (i.e., locally finite, and with pairwise 
disjoint closures), there exists a disjoint collection {U,} of open subsets of X 
such that A, C U, for every a. Finally, X is paracompact if every covering of X 
has a locally finite refinement. The relations between these three types of 
spaces, as shown by Bing (1), are that 


paracompact — collectionwise normal — normal, 


and that neither arrow can be reversed. 


2. Proof of Theorem 2. Let VY be a point-finite covering of the collectionwise 
normal space X. We are going to construct a sequence {W,} (i = 0, 1,...) 
of collections of open subsets of X such that, denoting U{wi|w ¢« W,} by 
W,, the following conditions are satisfied for all 7: 


(a) Every W € W, is a subset of some U € &. 
(b) W,, is locally finite (in fact, discrete). 


(c) If x € X is an element of at most i elements of W, then 


1 
xe€E U W,. 


k=0 
(d) Every x € W, is an element of at least i elements of V. 


Suppose, for a moment, that {W,} has been constructed, and notice how 
the theorem follows. In fact, remembering that YW is point-finite, we see that 
{W,} is a covering of X (by (c)) which is point-finite (by (d)). It then follows 
from Theorem 1 that {W,} has a locally finite refinement { V,}, with V; C W, 
for every i, and therefore U7_0{ Vv; W|W € W,} isa locally finite refinement 
of % (by (a) and (b)). 

It remains to construct the sequence {W,}. Let Wy = {¢} (i.e., the only 
element of % is the null set); then conditions (a)-(d) are clearly satisfied for 
i = 0. Suppose, therefore, that W, ..., YW, have been constructed to satisfy 
(a)-(d) for all i < m, and let’us construct W,.. 

Let ® be the family of all AC Wsuch that # has exactly n + 1 elements. 
For every # € §&, let 


A(@) =(x- im,)a(x- Uiuew|u ¢2@}) 


Clearly every A(#) isclosed. Let us show that {A(#)|(#) € MR} is discrete, 
by showing that every x € X has a neighborhood which intersects at most 
one A(#). We consider three cases: if x is in >n + 1 elements of Y, then the 
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intersection of any n + 2 of these does not intersect any A(A&); if x is in 
< n+ 1 elements of Y%, then (by (c)) 


x € U W,, 
k=0 
which does not intersect any A(#); and if, finally, x is in exactly » + 1 
elements of %, say in U;,..., Unss, then 
n+1 
fv, 
k=1 
is a neighborhood of x which does not intersect A (.”) for “# {U;,..., Uns} 


(since then at least one U, is not an element of and this U, cannot intersect 
A(S)). 

Since {A(#)|# € R} is thus a discrete collection of closed subsets of the 
collectionwise normal space X, there exists a disjoint collection {V(#)|F% € R} 
of open subsets of X such that A(#) C V(#) for every # € ®; by aresult of 
Dowker (3, p. 308), we can even pick {V(#)|F4 € R} to be discrete. Now notice 
that A(#) C U for every U € &, since otherwise some x € A (#) would be 
an element of <n elements of Y, which is impossible by (c) and the definition 
of A(#). Hence, if we let 


P(A) = VAAN {UU € B, 


then A(#) C P(A) forevery # € R. We now define Wrssr={P(2A)|AE R}. 
Let us check that conditions (a)-(d) are satisfied fori = + 1. That (a), (b), 
and (d) are satisfied follows directly from the definition of W,,,. To check (c), 
let x € X be an element of <n + 1 elements of Y; then clearly there exists 
an # € R such that x € (X — U{U € W@|U ¢A}). But then either 


x (x -Ujve vu ‘a}) ) (x -U w,) = A(R) C P(R) C Warr, 


n 
or xée U Wi; 
k=0 
thus in either case 


This completes the proof. 


3. The Counter-examples In this section we shall describe the spaces of 
Examples 1 and 2 in the introduction, and show that they have the required 
properties: 


Example 1. As a space with the required properties, we submit the normal, 
but not collectionwise normal, space F of Bing (1, Example G). We refer the 
reader to Bing’s paper for the definition of F, and for the related notation. 
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We shall use Bing’s notation, adding one additional piece of notation of our 
own: If p € P, and if r is a finite subcollection of Q, then 


(pr) = tf € F|\ f@ =f,@) forallg € 7}. 


We must show that every point-finite covering of F has a locally finite 
refinement. So let W be a point-finite covering of F. Let ¥ = {U € ZIUN 
F, # }. There are now two possibilities: 


(a) ¥ is countable. Let V = U{V|V € ¥ }. Then Vis an open and closed 
subset of F, and is therefore normal. Hence ¥ is a countable, point-finite 
covering of the normal space V, and hence (by Theorem 1) has a locally finite 
refinement #. If we now let “= AU {{fi|f € (F — V)}, then 7 is a 
locally finite refinement of %. 


(b) ¥ is uncountable. We shall show that this is impossible. For suppose it 
is true. Then, it is easy to check, there exists an uncountable subset M of P, 
and for each p € M a finite subcollection r, of Q, such that the family of all 
(p, Ap), with p € M, is point-finite. Bing’s proof that F is not collectionwise 
normal actually proves that such a family cannot be disjoint: the proof that 
it cannot even be point-finite is very similar, and we therefore only indicate 
the necessary modification in Bing’s proof. Bing begins by assuming that the 
collection of all (p,7,) is disjoint, and obtains his contradiction by finally 
showing that it isn’t even point-finite. The only place where Bing actually 
uses the disjointness of {(p, 7,)}, « is, essentially, to show the existence of 
an uncountable W,’ C W (where W is an uncountable subset of M), and 
aqi € Q, such that gq, € 7, for every p € W,;’. To show the existence of q: 
and W,’ even under the weaker assumption that { (p, r,)}, € 4 is point-finite, 
we proceed as follows: Let T be a maximal subset of W having the property 
that r, (\r,’ = @ whenever p € T, p’ € T, p # p’; the existence of such a set 
follows from Zorn’s lemma. It is easy to see that 


N rd, T) ¥~ >, 
and hence T must be finite. Letting r = U,.7 1,, we see that 7 is a finite 
subcollection of Q. Now for every g € r, let E, = {p € Wlq € 1,}; it follows 
from the maximality of T that U,., E, = W. Hence E, must be uncountable 
for at least one g € r, say for q:. If we now let W,;’ = E, , then W,’ and qi 
have the required properties. 

To obtain a space, satisfying our requirements, which is also perfectly 
normal (i.e., every closed subset is a G;), we need only replace the above 
space F of Example G of (1) by the space F of Example H of (1). The proof 
goes just as before. 


Example 2. We shall construct a normal, non-collectionwise normal space G, 
every covering of which has a point-finite refinement. (This last property is 
sometimes called pointwise paracompactness.) This space certainly has all 
required properties, since if every point-finite covering of G had a locally 
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finite refinement, it would follow that G is paracompact, and hence collection- 
wise normal, which it is not. 

To obtain G, we begin with the space F of Bing (1, Example G) which was 
used in Example 1, and then let G be the subspace of F defined by 


G=F,U {f € F|f@ = 0 except for finitely many g € Q}. 


Since G is a closed subset of F, G is normal. Bing’s proof that F is not collection- 
wise normal goes through verbatim to show that G is not collectionwise normal. 
All that remains to show is that every covering of G has a point-finite refine- 
ment. 

Let % be a covering of G. For each p € P, pick a U, € Wsuch that f, € U,, 
and let V, = {f € G|f(p) = 1}. It follows from the definition of G that 
{V,}> «2 is point-finite. If we now let 


W= ({We A Uslow JU {1 n0-70), 
then W is clearly a point-finite refinement of Y. This completes the proof that 
G has all the required properties. 

Just as in Example 1, we can obtain a space, satisfying all our requirements, 
which is also perfectly normal. In fact, all we need to do is to start with the 
space F of Example H of (1), rather than with the space F of Example G of 
(1). We then let 


G= F,U {f € F| f(q@ is even except for finitely many g € Q}. 
The proof that G does the trick proceeds just as before. 
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ON DYNAMICAL SYSTEMS WITH ONE DEGREE 
OF FREEDOM 


Cc. R. PUTNAM 


1. Introduction. Consider the (vector, m-component) system of differential 
equations 


(1) x’ = f(x) (' = d/d2), 


where f(x) is of class C'. Let 2 denote a set of points, x, consisting of unrestricted 
solution paths x(t), so that the x(t) exist and lie in Q for — » <it< o. 
Let ¢ = t) be arbitrary but fixed; then the solution x = x(¢) will be called 
stable (with respect to Q) if for every « > 0, there exists a 6 = 6, > 0 such 
that |x(t) — y(t)| < « whenever y(#) is in Q and |x(to) — y(to)| < 6. For a 
discussion of this type of stability (called A-stability in (7)), see Liapounoff 
(4, pp. 210-211; 8, pp. 98-99), wherein are given references to Minding and 
Dirichlet. 

It was shown by Hartman and Wintner (3) that a solution x(¢) of (1) which 
is dense on a compact set 2 is almost periodic (in the sense of Bohr) if and 
only if it is stable in a certain sense. The type of stability considered there 
(called B-stability in (7)), however, is more restrictive than the A-stability, 
and, in the sequel, the term ‘“‘stability’’ will refer only to that (A-stability) 
defined at the beginning of this section. It was shown in (7) that if (1) is of 
the incompressible type, so that 


(2) div f = Ldf,/dx, = 0, 


and if the space is suitably restricted, then all stable solutions of (1) do have 
certain properties possessed by almost periodic solutions. Whether all such 
solutions, for m arbitrary, are actually almost periodic will remain undecided. 
(If (2) is not assumed, then stability surely does not imply almost periodicity 
even if the dimension number m of (1) is unity; see (7).) 

The present paper will be devoted to a consideration of (1), subject to the 
(measure-preserving) condition (2), in the special case when m = 2. It is known 
that the system (1) is then equivalent to a conservative Hamiltonian system 
(8, p. 88), and hence, in view of the existence of the energy integral, is com- 
pletely integrable. In what follows then, only Hamiltonian systems of one 
degree of freedom, that is, systems of the type 


(3) bp’ = — dH/dq, 7 = 0H/dp (H=H(p,q), * = d/dt), 
will be considered. The identification with (1) isx; = p,x2 = ¢,f1 = — 0H/dq, 
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fo = 0H/dp; the assumption that f(x) of (1) be of class C' is now that the 
Hamiltonian H(p, q) be of class C*. The following theorem will be proved: 


I. Let H(p, q) be of class C* on the p, q space, and let 2 denote an unrestricted 
invariant set of finite positive measure. Then through every point x = (p,q) of 
Q, except possibly for those belonging to a set Z of measure 0, there exists either an 
equilibrium solution (x = const.) or a periodic solution of (3). 


It will turn out that the set Z is the set excluded from the assertion of the 
Poincaré recurrence theorem on Q (see §3 below). It was shown in (7), however, 
that, even for general systems (1) satisfying (2) with dimension number n 
arbitrary, if the set Q satisfies 
(4) meas 2S > 0, 


where x is an arbitrary point of 2 and = is any open sphere (disk, in the 
present case) with center at x, then no point of a stable path can belong to Z. 
As a consequence, there follows the theorem 


II. Under the same assumptions as in (1), along with the additional condition 


(4), every stable solution path x(t) = (p(t), q(t)) is periodic (possibly constant) 
m—- 2 <i<c @, 


Needless to say, the condition (4) is fulfilled if, for instance, the set Q is 
open or is the closure of 2n open set. 

It should be noted that the existence of a closed (Jordan) curve in the p, ¢ 
space of the form H = const. does not necessarily imply that this is the path 
of a periodic solution of (3). One need only consider the physical example of a 
simple pendulum oscillating with an energy just sufficient to raise the pendulum 
(asymptotically) to its greatest possible height, corresponding to a position 
of unstable equilibrium. 

For a general discussion of systems (1) when m = 2, see the series of papers 
of Poincaré (5), especially the one of 1885. It should be noted that the notion 
of stability considered there (5, pp. 167-172) is not that of the present paper, 
but what is sometimes termed stability in the sense of Poisson. In this connec- 
tion, compare the recurrence theorem of Poincaré (6, pp. 67 ff) cited at the 
beginning of §3 below. For further references to the case n = 2, see Birkhoff 
(1), in particular pp. 123-124, and Brown (2). 

Another corollary of I is 


Ill. Let H = H(p, q) be of class C* and suppose that the point (po, qo) is an 
isolated equilibrium point of the system (3). If (po, qo) is a stable point (that is, 
if the solution p = po, = Qo is stable), then it is either a (local) maximum or a 
minimum point of the funtion H(p, q). 


The question as to whether there is a theorem corresponding to III for the 
case of a conservative dynamical system with m degrees of freedom was 
pointed out by Wintner (8, p. 101) and will remain undecided. It is known 
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that the conclusion of III can become false if the restriction that (po, go) 
be an isolated equilibrium point is dropped (8, pp. 100-101). 

The proof of III as a consequence of Theorem I will be given in §2; the proof 
of I will be given in §3. 


2. Proof of III. Grant Theorem I. Since xo = (po, go) is a stable equilibrium 
point, there exists a sequence of invariant (and, if desired, open) sets contain- 
ing, and closing down upon, the point x» (Poincaré-Birkhoff criterion). Choose 
one of these sets and call it Q; it is clear that the assumptions of I are now 
fulfilled. 

Since x» is an isolated equilibrium point, it follows from I that through 
almost all points sufficiently close to x» there exist (non-constant) periodic 
solution of (3), corresponding to closed Jordan curves in the p,q space. 
Consider a sequence of such curves C;, C2, . . . , which, in view of the stability 
assumption on Xo, tend to the point x». Since the energy integrals H = const. 
of (2) are the equations of the solutions in the p, g plane, each of the curves 
C,, is a level curve of H. Hence the function H attains either a (local) maximum 
or a minimum value at at least one point, say x,, inside each (Jordan) curve 
C,. Clearly the x, are equilibrium points and satisfy x, x» as m— @. Since 
xo is an isolated equilibrium point, then x, = x» for m sufficiently large (and 
hence for some one value of m). Consequently, x» is either a maximum or a 
minimum point of H and the proof of III is complete. 


3. Proof of I. The assumptions on @ are those guaranteeing the validity 
of the Poincaré recurrence theorem (cf., in this connection, 6, pp. 67 ff.;8, 
p. 91; 7). Let Z denote the set of measure zero excluded from the assertion of 
this theorem, so that, if x» is not in Z, the solution path x(t) through x» has 
the following property: if @ is arbitrary, there exists a sequence of dates ¢,, 
where m = + 1, +2,..., such that 4, ~ © or — © according as n— © or 
— and x(t,) — x(f) as |m| — o. It will be shown that if x9 = (po, go) is 
in Q — Z, then the (vector) function x(#), where x9 = x(to), is periodic or 
constant (for — © <i< o). 

Suppose, if possible, that x(#) # const. and not periodic. Since x9 = x(to) 
is in 2 — Z, there exist values ¢, and points x, = x(t,) such that x, # x» and 
X_ —> Xo as n — @. Since x(t) # const., so that x is not an equilibrium point, 
the general existence theorem along with the attending continuity properties 
for solutions of ordinary differential systems (1), guarantees the existence of 
points X, on the curve x(t) which lie on the normal line to this curve at the 
point xo, and satisfy X, (xo) — x». Since the solution path curve x(¢) con- 
stitutes a branch of the locus H(p,q) = c, for some constant ¢, it is clear 
that the directional derivatives of H, taken along the tangent and the normal 
to the path x(t) at the point x», are zero. Consequently, the vector grad H 
is zero at this point; and so x» is an equilibrium point, in contradiction with 


the supposition at the beginning of this paragraph. This completes the proof 
of I. 
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THE PROPAGATION OF A PLANE SHOCK INTO A 
QUIET ATMOSPHERE II 


M. H. MARTIN 


1. Introduction. In a previous paper (3) the study of one-dimensional, 
unsteady flows, isentropic or anisentropic, was reduced to the integration of a 
Monge-Ampére partial differential equation 


(1) Evtnn — f°bp = Tp: 
For a polytropic gas, the specific volume 


(2) c= e(5- Se) les P p", = 1/¥, 
takes the form 
tT = 5(y)p", 


once the entropy distribution function S = S(W) is specified. A solution 
& = £(y, p) having been determined by one means or another, the actual flow 
is presented by the mapping 


(3) x= f {Gee + 7) dv + Eotnib, t= & (w= &), 


of the (¥, »)-plane upon the (x, ¢)-plane. This mapping carries the rectilinear 
network y = const., » = const., in the (y, )-plane into the curvilinear 
network of trajectories and isobars in the (x, #)-plane. 

A progressive condensation shock, carrying in back of it the values u, 7, p 
of the velocity, specific volume and pressure and moving into a quiet atmo- 
sphere where these quantities have fixed values uo, to, Po is governed by the 
shock conditions: 





(4.1) u = Uy + V (p — po) (ro — 7), 
(4.2) F = uo + rof/ P—L2, 
(4.3) om (y ox 1)p + (y + 1) po 


~ 8 y + 1p + (y — Iho’ 
where (4.2) gives the shock velocity. 

Once the entropy distribution function S(W) is selected, the determination 
of the motion of the shock into the quiet atmosphere and of the flow immedi- 
ately behind it, sets (3) a Problem of Cauchy for the controlling partial 
differential equation (1). 
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If, however, one is willing to forgo knowledge of the flow immediately in 
back of the shock, the motion of the shock into the quiet atmosphere can be 
determined without solving the Problem of Cauchy. In this paper we investigate 
the influence of the choice of the entropy distribution function S(w) upon the 
propagation of the shock into the quiet atmosphere. Among other things we 
find that the time required (2, p. 213) for complete decay of the shock may be 


finite or infinite, depending upon the selection of the entropy distribution 
function. 


2. The shock curve. If we substitute from (2) for r in (4.3) and prescribe 
an entropy distribution function S(y), the resulting equation 


i (y — 1)p + (y + 1)po ‘) 

5) oe = eae (- (y+ lp + (y— po” 

defines a curve y = ¥(p) in the (y, p)-plane. This curve transforms by (3) 
into the shock curve in the (x, ¢)-plane and we propose to deduce its parametric 
equations 


(6) x=x(p), t= t(p), 
for a given entropy distribution function S(y). Here » denotes the pressure 
immediately in back of the shock. 


From (3) we see that a curve y = ¥(p) in the (y, »)-plane is carried into a 
curve x = x(t) in the (x, ¢)-plane along which 





dx _ dy [at 
(7) ho +r ap| ap 
As a matter of fact this result is an immediate consequence of the relation 
(8) dy = pdx — pudt, 


i.e., of the principle of conservation of mass. 
We take (5) for the curve y = ¥(p), substitute in (7) for wu and dx/dt from 
(4.1) and (4.2), to obtain 


(0) dt _ dy (2="). 





dp dp pb — Po 
When 7 is eliminated from this by (4.3) we find 
= dy ( 2r0 ) d 
(10) t= JZ + lp + G— lps 


to reduce the determination of the function ¢(p) in (6) to a quadrature, once 
the function y = y¥(p) is fixed by the selection of S(w) in (5). 
To determine the function x(p) in (6), we write (4.2) in the form 


dx _ ne 
[utr tT —tJdp’ 
and substitute in here for dt/dp from (9). This yields 
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(11) x = top + uol, 


where y, ¢ are the functions of » defined in (5) and (10). This result may be 
checked by applying the principle of conservation of mass across the shock in 
the form 


dy = pdx — pudt = podx — pouolt. 
We sum up our results on the shock curve in the theorem below. 
THEOREM. For a shock moving into a quiet atmosphere in which the velocity u, 
specific volume rt, and pressure p have fixed values uo, To, Po, the shock curve 


in the (x, t)-plane is described parametrically in terms of the pressure p immedi- 
ately in back of the shock by 


nm ” dy /( 270 ) 
(12) x = tov + uo, ‘-fZ (y+ 1)p + (vy — 1)po a, 


in which the function ~(p) is defined implicitly by 


S(y) =a loe(r. i + i + * s Ar *)+ So, So = const., 


upon prescription of the entropy distribution function S(p). 








3. Shock decay. The problem of shock decay is of some interest (1; 2) 
and we shall make some remarks on how decay is affected by the choice of the 
entropy distribution function. 

First of all, for a shock to decay completely it is necessary and sufficient 
that the pressure p in back of the shock equal the pressure » in front of the 
shock. 

Starting with a pressure ~: > » in back of the shock, it is clear from the 
above theorem that complete decay will require a finite or an infinite time, 
according as the integral 


— dy 270 
(13) t(bo) — thr) = J dp (~ +1)p+- im) ” 





converges or diverges. 

To fix the ideas, let us take u» = 0, so that the gas in front of the shock is 
at rest, and assume that the entropy distribution function S(~) is chosen so 
that 


(14) -a<%<o for pi <P < Po. 


From the parametric equations (12) of the shock curve it is clear that x and ¢ 
both increase monotonically for decreasing » as shown in Fig. 1. 

If the entropy distribution function S(y) is taken so that dy/dp is finite at 
Po, or is infinite for p = p» to an order low enough so that the integral (13) 
still converges, the shock curve will end at a finite point P» in the (x, #)-plane. 
Under these conditions the shock decays after covering a finite distance in a 
finite time. 


| 
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Ficure 1. The shock curve for dy/dp < 0. 


On the other hand, if S(y) be chosen so that dy/dp is infinite for p = po 
and the integral (13) diverges, both x and ¢ become infinite as p decreases to pp. 
In this case the shock travels an infinite distance and requires an infinite time 
before it decays. 

Decay after travelling an infinite distance in a finite time or a finite distance 
in an infinite time is accordingly not possible. 

Let us write (5) in the form 


= (y — 1)p + (y+ pol 2") 7 
(15) S(¥) —- S=G le | {2 F 1) + (y — l)pe\ do , So = const., 


in which Sp may be interpreted as the specific entropy of the quiet atmosphere 
in front of the shock. Expansion of the second member in powers of p — po 
yields 


(16) S(v) — So = A(p — po)? +..., A = n(1 — n*)c,/12p,', 


and differentiation gives 








a7) Sv) = ol - 1") (= pe) 
a Pl(y — 1)p + (vy + Upoll(y + Dp + (y — 1)pol’ 
where S’(W) is the derivative of S(y). 

From (17) it is clear that (14) requires S’ < 0, i.e., that the entropy dis- 
tribution function be monotone decreasing. Furthermore the shock will require 
a finite time to decay if S’(y), expressed as a function of », vanishes to less 
than the third order in p — po; on the other hand, if S’(W) vanishes to the 
third or a higher order in p — po, the time for decay is infinite. 


4. Examples of shock decay. We illustrate the general principles in the 
previous section by simple examples. 
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For the first example, take 


S(y) = So — ¥, 
so that, from (16) 


¥ = —A(p— po) +..., — 3A(p — po)? +... 


from which and (13) it is clear that the shock decays in finite time. 
For the second example, take 


Sy) =Ss+y', 
for which 


- - d i = 
vA pe) +..., B= -34%G- py +..., 
and it is clear again from (13) that shock decay now requires an infinite time. 
More generally let us take 
k 


SW) = So -*, jaekex.... 


Clearly S’(y) < 0 for all stated k, and its expansion in powers of p — pp is 
SW) = —Bip — po)" +... 


where B is a positive constant. Thus S’(y) will vanish to an order less than 
three at po for k = 1,3, ... and the shock decays in finite time; for k = —1, 
—3,...S’(W) vanishes to a higher order than 3 at po and decay requires an 
infinite time. 
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